GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 


OM protein - protein search, using sw model 

Run on: March 5, 2004, 16:14:14 ; Search time 138 Seconds 

(without alignments) 
282.548 Million cell updates/sec 


Title: US-10-057-890A-10 
Perfect score: 797 

Sequence: 1 MDYQVSSPIYDINYYTSEPC GLNNCS S SNRLDGHQRVHAA 138 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1586107 seqs, 282547505 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 
Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 


1586107 


Database : A_Geneseq_2 9 Jan04 : * 

1 : geneseqpl980s : * 

2: geneseqpl990s : * 

3: geneseqp2000s : * 

4 : geneseqp2001s : * 

5: geneseqp2002s : * 

6: geneseqp2003as : * 

7 : geneseqp2003bs : * 

8 : geneseqp2004s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 


SUMMARIES 

% 

•Result Query 

No. Score Match Length DB ID Description 


1 

797 

100 

.0 

138 

5 

ABG32539 

Abg32539 

Human 

CCR 

2 

797 

100 

.0 

157 

5 

ABG32540 

Abg32540 

Human 

CCR 

3 

379 

47 

.6 

352 

2 

AAW27407 

Aaw27407 

Human 

CCR 

4 

379 

47 

.6 

352 

2 

AAW27123 

Aaw27123 

Human 

che 

5 

379 

47 

.6 

352 

2 

AAW23835 

Aaw23835 

Human 

CC 

6 

379 

47 

.6 

352 

2 

AAW88232 

Aaw88232 

HIV-1 

co- 

7 

379 

47 

.6 

352 

4 

AAE07048 

Aae07048 

Human 

G-p 

8 

379 

47 

.6 

352 

4 

AAG80111 

Aag80111 

Human 

CCR 

9 

379 

47 

.6 

352 

4 

AAE04321 

Aae04321 

Human 

che 


10 

379 

47 . 

6 

352 

4 

AAE 0 / U 3 y 

J-\o.kz \J / U o _? 

numan o p 

11 

379 

47 . 

6 

352 

4 

AAB4 dodo 

r\d U4DO JO 

Unman Hnfi 

numan nuo 

12 

379 

47 . 

6 

352 

4 

AddjDo4Z 

Ahh 9 

NT ^ n — onHnn 
iNuii c i 

13 

379 

47 . 

6 

352 

4 

AAB83334 

Ad JJO J J J4 

n uirica. n ^^r\ 

14 

379 

47 . 

6 

352 

4 

AAB8294 8 


nurrian niv 

15 

379 

47 . 

6 

352 

5 

TV TV TT H H 1 CO 

AAU9 / 152 

i-Va U j 1 J. 0 Z 

riuman \j p 

16 

379 

47 . 

6 

352 

5 

AAM52 82 9 

A-STTlS9 ft 9 Q 

/\amo Z O Z Z7 

nuruian 

17 

379 

47 . 

6 

352 

r 

5 

AAM5z oz o 

A^m c >9 ft 9 ft 

/\a.iiio z o z o 

Human C*C* 

n u.uici n ^ w 

18 

379 

47 . 

6 

352 

5 

ABG luo3 / 

aK n 7n SQ7 

Human ^ — Ti 

nuiua.ii \J p 

19 

379 

47 . 

6 

352 

5 

ABG9Z 8 83 

7\u«qoo p o 

HiiTnan n mm 

nurrian iiruii 

20 

379 

47 . 

6 

352 

5 

-r\ 7v T— i A r A 1 -1 

AAE25811 

/\a. cZ JOll 

Unman C* K> 

nurrian o p 

21 

379 

47 . 

6 

352 

r 

5 

ABBo 1Uo4 



22 

379 

47 . 

6 

352 

5 

ABBU o 343 


Unman cno 
nUllla.il one 

23 

379 

47 . 

6 

352 

6 

ABG75540 

ADg / o Dfi u 

numan u p 

24 

379 

47 . 

6 

352 

6 

ABR58602 

ADlJO OUZ 

numan can 

25 

379 

47 . 

6 

352 

6 

AA02 9514 

Aa O Z Z3 3 i. *i 

numan <w o 

26 

379 

47 . 

6 

352 

6 

i\ nTt/"1 /~C jl 

ABU61654 

ADUol DOfi 

numan \j p 

27 

379 

47 . 

6 

352 

6 

ABP9 / / 28 

/\Dpy / / Z 0 

Ammo aci 

28 

379 

47 . 

6 

352 

6 

ABP8 1933 

ADp OlJOJ 

Unman C C 

n umd n o o 

29 

379 

47 . 

6 

352 

7 

TV T~\ Z - ** A n O i 1 

ADC03341 


numan cne 

30 

379 

47 . 

6 

439 

2 

AAY41280 

Aayfl iz o u 

rusion pr 

31 

374 

46 . 

9 

371 

2 

AAW23834 

7A =»t tO "5 ft 1A 

AawZo oofi 

numan 

32 

373 

46 . 

8 

352 

2 

AAWO /60z 

AaWU /OUZ 

Unman CZ. to 

numa.il \j p 

33 

373 

46. 

8 

352 

3 

AAY8 0128 

ta^ttQHI Oft 

as y o u x z o 

nUman p 

34 

373 

46. 

8 

352 

4 

AAE07 046 

Aa. eu / U 4 o 

numan o p 

35 

373 

46 . 

8 

352 

4 

AAEO / 03 / 

7\ -,^07 0^7 

Unman t> 

nUIUdll o p 

36 

373 

46 . 

8 

352 

5 

AAU97150 


Ui ima n ni 

numan \j p 

37 

373 

46. 

8 

352 

5 

-»i -t-x j^t f\ r\ o o A"\ 

ABG92880 

Aogyz o o u 

numan o p 

38 

373 

46 . 

8 

352 

5 

AAE25808 

Aa e z o o u o 

nuiriari \j p 

39 

363 

45 . 

5 

352 

2 

AAW27125 

Aawz / IZ D 

lYiacaque o 

40 

363 

45 . 

r 

5 

3oz 

/ 

AULU33Dy 

AU. LU J JJ J 


41 

359 

45. 

0 

352 

4 

AAG79089 

Aag79089 

Amino aci 

42 

272 

34. 

1 

354 

2 

AAW54037 

Aaw54037 

Mouse CC- 

43 

258 

32. 

4 

184 

2 

AAW27406 

Aaw27406 

Inactive 

44 

258 

32. 

4 

215 

2 

AAW27408 

Aaw27408 

Inactive 

45 

258 

32. 

4 

215 

2 

AAW88238 

Aaw88238 

HIV-1 co- 


ALIGNMENTS 


RESULT 1 
ABG32539 

ID ABG32539 standard; protein; 138 AA. 
XX 

AC ABG32539; 
XX 

DT 15-NOV-2002 (first entry) 
XX 

DE Human CCR5-based scaffolded fusion protein #1. 
XX 

KW Scaffolded protein; CCR5; HIV; human immunodeficiency virus infection; 

KW ECD; extracellular domain; metal chelating motif; zinc finger protein; 

KW integral membrane protein; soluble loop; intracellular domain; ICD; 

KW gene therapy; immunogen; viral infection; human. 
XX 

OS Homo sapiens. 


OS Synthetic. 
XX 

PN WO200260477-A1. 
XX 

PD 08-AUG-2002. 
XX 

PF 29-JAN-2002; 2002WO-US002377 . 
XX 

PR 31-JAN-2001; 2001US-0265782P . 

PR 31-JAN-2001; 2001US-0265858P . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Coleman TA, Mansfield B; 
XX 

DR WPI; 2002-643357/69. 
XX 

PT Novel scaffolded fusion polypeptide useful for therapeutic purposes or 

PT for screening molecules that bind/activate/inhibit/modulate the 

PT polypeptide, comprises a functional polypeptide domain fused to a 

PT scaffold domain. 

XX 

PS Example 1; Page 21; 64pp; English. 
XX 

CC The invention relates to a scaffolded fusion polypeptide comprising a 

CC functional polypeptide domain fused to a scaffold domain, where the 

CC functional polypeptide domain corresponds to a soluble loop of an 

CC integral membrane protein (e.g. human CCR5, a transmembrane receptor 

CC involved in HIV (human immunodeficiency virus) infection) . Also included 

CC are; (1) a polypeptide comprising a scaffold domain; (2) a nucleic acid 

CC encoding the fusion polypeptide; (3) a vector cassette for the expression 

CC of the fusion polypeptide comprising an expression region operably linked 

CC to a promoter, where the expression region comprises a number of 

CC cassettes, each of which encodes a module, domain or strand of the fusion 

CC polypeptide and (4) a host cell comprising the vector or nucleic acid. 

CC The fusion polypeptide is useful for screening molecules that 

CC bind/activate/inhibit/modulate the fusion polypeptide, by expressing the 

CC fusion polypeptide from and identifying a molecule that binds to the 

CC fusion polypeptide. The fusion polypeptide is useful in diagnostic 

CC methods, in assays to identify compounds that interact with loops of 

CC fragments of an extracellular domain (ECD) or an intracellular domain 

CC (ICD) or to rapidly assay the function of mutated portions of mutant 

CC integral membrane proteins without having to produce significant 

CC quantities of the entire mutant integral membrane protein, to generate 

CC antibodies that recognise the integral membrane proteins from which they 

CC are designed, to competitively bind the ligand of a naturally occurring 

CC receptor in vitro or in vivo, to display and/or screen soluble domains 

CC from protein such as integral membrane proteins, to probe the structure 

CC of ECD or ICD, or both, of an integral protein membrane, to modulate the 

CC activity of a receptor in vivo, and for treating or preventing viral 

CC infection, preferably human HIV infection e.g. by gene therapy using the 

CC encoding nucleic acid. The present sequence is a scaffolded protein based 

CC on the ECD region of human CCR5 (not defined) 

XX 

SQ Sequence 138 AA; 


Query Match 


100.0%; Score 797; DB 5; Length 138; 


Best Local Similarity 100.0%; Pred. No. 1.8e-75; 

Matches 138; Conservative 0; Mismatches 0; Indels 0; Gaps 


Qy 


1 MDYQVSSPIYDINYYTSEPCQKINVKQIAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHS 60 



Db 


1 MDYQVSSPIYDINYYTSEPCQKINVKQIAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHS 60 


Qy 


61 YKCGLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGL 120 



Db 


61 YKCGLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGL 12 0 


QY 


121 NNCSSSNRLDGHQRVHAA 13 8 



Db 


121 NNCSSSNRLDGHQRVHAA 138 


RESULT 2 
ABG32540 

ID ABG32540 standard; protein; 157 AA. 
XX 

AC ABG32540; 
XX 

DT 15-NOV-2002 (first entry) 
XX 

DE Human CCR5-based scaffolded fusion protein #2. 
XX 

KW Scaffolded protein; CCR5; HIV; human immunodeficiency virus infection; 
KW ECD; extracellular domain; metal chelating motif; zinc finger protein; 
KW integral membrane protein; soluble loop; intracellular domain; ICD; 
KW gene therapy; immunogen; viral infection; human. 
XX 

OS Homo sapiens. 
OS Synthetic. 
XX 

FH Key Location/Qualifiers 
FT Peptide 1. .19 

FT /label= Signal_peptide 

FT Protein 20. .157 

FT /label= Mature_scaf f olded_protein 

XX 

PN WO200260477-A1. 
XX 

PD 08-AUG-2002. 
XX 

PF 29-JAN-2002; 2002WO-US002377 . 
XX 

PR 31-JAN-2001; 2001US-0265782P . 
PR 31-JAN-2001; 2001US-0265858P . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Coleman TA, Mansfield B; 
XX 

DR WPI; 2002-643357/69. 
XX 

PT Novel scaffolded fusion polypeptide useful for therapeutic purposes or 
PT for screening molecules that bind/activate/inhibit/modulate the 


PT polypeptide, comprises a functional polypeptide domain fused to a 

PT scaffold domain* 

XX 

PS Example 2; Page 41; 64pp; English. 
XX 

CC The invention relates to a scaffolded fusion polypeptide comprising a 

CC functional polypeptide domain fused to a scaffold domain / where the 

CC functional polypeptide domain corresponds to a soluble loop of an 

CC integral membrane protein (e.g. human CCR5, a transmembrane receptor 

CC involved in HIV (human immunodeficiency virus) infection) . Also included 

CC are; (1) a polypeptide comprising a scaffold domain; (2) a nucleic acid 

CC encoding the fusion polypeptide; (3) a vector cassette for the expression 

CC of the fusion polypeptide comprising an expression region operably linked 

CC to a promoter, where the expression region comprises a number of 

CC cassettes, each of which encodes a module, domain or strand of the fusion 

CC polypeptide and (4) a host cell comprising the vector or nucleic acid. 

CC The fusion polypeptide is useful for screening molecules that 

CC bind/activate/inhibit/modulate the fusion polypeptide, by expressing the 

CC fusion polypeptide from and identifying a molecule that binds to the 

CC fusion polypeptide. The fusion polypeptide is useful in diagnostic 

CC methods, in assays to identify compounds that interact with loops of 

CC fragments of an extracellular domain (ECD) or an intracellular domain 

CC (ICD) or to rapidly assay the function of mutated portions of mutant 

CC integral membrane proteins without having to produce significant 

CC quantities of the entire mutant integral membrane protein, to generate 

CC antibodies that recognise the integral membrane proteins from which they 

CC are designed, to competitively bind the ligand of a naturally occurring 

CC receptor in vitro or in vivo, to display and/or screen soluble domains 

CC from protein such as integral membrane proteins, to probe the structure 

CC of ECD or ICD, or both, of an integral protein membrane, to modulate the 

CC activity of a receptor in vivo, and for treating or preventing viral 

CC infection, preferably human HIV infection e.g. by gene therapy using the 

CC encoding nucleic acid. The present sequence is a scaffolded protein based 

CC on the ECD region of human CCR5 (not defined) 

XX 

SQ Sequence 157 AA; 

Query Match 100.0%; Score 797; DB 5; Length 157; 

Best Local Similarity 100.0%; Pred. No. 2.1e-75; 

Matches 138; Conservative 0; Mismatches 0; Indels 0; Gaps 0 


QY 


1 MDYQVSSPIYDINYYTSEPCQKINVKQIAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHS 60 



Db 


20 MDYQVS S P I YDINYYTSEPCQKINVKQI AAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHS 79 


QY 


Db 


61 YKCGLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGL 120 

M | | I M I I I I I I II I I I I I I I I I I M I I I I I I I I I I I I I I I II I I I I I I I I I I I M I I I 
8 0 YKCGLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGL 139 


QY 


121 NNCSSSNRLDGHQRVHAA 138 



Db 


140 NNCSSSNRLDGHQRVHAA 157 


RESULT 3 
AAW27407 

ID AAW27407 standard; protein; 352 AA. 


XX 

AC AAW27407; 
XX 

DT 14-APR-1998 (first entry) 
XX 

DE Human CCR5 . 
XX 

KW Human Cys-Cys chemokine receptor 5; CCR5; human immunodeficiency virus; 

KW type 1; type 2; HIV-1; HIV-2; diagnosis; treatment; prevention; 

KW inflammatory disease; rheumatoid arthritis; glomerulonephritis; asthma; 

KW idiopathic pulmonary fibrosis; psoriasis; viral infection; cancer; 

KW atherosclerosis; autoimmune disorder. 

XX 

OS Homo sapiens . 
XX 

PN WO9732019-A2. 
XX 

PD 04-SEP-1997. 
XX 

PF 28-FEB-1997; 97WO-BE000023 . 
XX 

PR 01-MAR-1996; 96EP-00870021 . 

PR 06-AUG-1996; 96EP-0 087 0102 . 
XX 

PA (EURO-) EUROS CREEN SA. 
XX 

PI Samson M, Parmentier M, Vassart G, Libert F; 
XX 

DR WPI; 1997-479829/44. 

DR N-PSDB; AAT90117. 
XX 

PT Active and inactive forms of human CC chemokine receptor CCR-5 - useful 

PT to diagnose, prevent and/or treat inflammatory disorders , autoimmune 

PT disease and viral infection. 
XX 

PS Claim 4; Fig lb-c; 94pp; English. 
XX 

CC The present sequence is human CC (Cys-Cys) chemokine receptor 5 (CCR5), 

CC which is stimulated by MIP-1 alpha, MIP-1 beta or RANTES chemokines, but 

CC not by monocyte chemoattractant protein 1 (MCP-1) , MCP-2, MCP-3, 

CC interleukin-8 (IL-8) or growth related gene product alpha (GRO alpha) 

CC chemokines. Active CCR-5 is also a receptor of human immunodeficiency 

CC virus type 1 or type 2 (HIV-1 or HIV-2). CCR5 or its cDNA can used to 

CC diagnose, treat and/or prevent inflammatory diseases, e.g. rheumatoid 

CC arthritis, glomerulonephritis, asthma, idiopathic pulmonary fibrosis and 

CC psoriasis, viral infections, especially HIV-1 or HIV-2 infection, cancer, 

CC atherosclerosis and autoimmune disorders 
XX 

SQ Sequence 352 AA; 

Query Match 47.6%; Score 379; DB 2; Length 352; 
Best Local Similarity 35.0%; Pred. No. 3.2e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

Qy 1 MDYQVS S P I YDI N YYT S EP CQKI NVKQI AA 30 

I I I I I I I I I I I I I I I I I I I II 1 I I I I I I I I 

Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 


Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

II I I I I I I I I I I I I I : I :: I 

Db 61 LKSMTDI YLLNLAI SDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFI GFFSGI FF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I II I I 

Db 119 IILLTIDRYLAVVHAVFALKARTVTFGVVTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

II I I I I II I I I I I II I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I II I I II II I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 4 
AAW27123 

ID AAW27123 standard; protein; 352 AA. 
XX 

AC AAW27123; 
XX 

DT 14-DEC-1997 (first entry) 
XX 

DE Human chemokine receptor 8 8C. 
XX 

KW Chemokine receptor 88C; atherosclerosis; rheumatoid arthritis; tumour; 
KW asthma; viral infection; AIDS; inflammation; autoimmune disease; therapy; 
KW diagnosis; leukocyte trafficking; G protein coupled receptor; ligand; 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
XX 

PN W09722698-A2. 
XX 

PD 26-JUN-1997. 
XX 


modulator; 

antibody; human. 


Homo sapiens. 



Key 


Location/ Qualifiers 


Domain 


1. .32 




/label= Extracellular 

domain 

Domain 


56. .67 




/label= Intracellular 

domain 

Domain 


8 9. .112 




/label= Extracellular 

domain 

Domain 


125. .145 




/label= Intracellular 

domain 

Domain 


166. .191 




/label= Extracellular 

domain 

Domain 


213. .235 




/label= Intracellular^ 

domain 

Domain 


259. .280 




/label= Extracellular^ 

domain 

Domain 


301. .352 




/label= Intracellular^ 

domain 


PF 20-DEC-1996; 96WO-US020759 . 
XX 

PR 20-DEC-1995; 95US-00575967 . 

PR 07-JUN-1996; 96US-00661393 . 
XX 

PA (ICOS-) ICOS CORP. 
XX 

PI Gray PW, Schweickart VL, Raport CJ; 
XX 

DR WPI; 1997-341689/31. 

DR N-PSDB; AAT85161. 
XX 

PT New nucleic acid encoding chemokine receptors 88-2B and 88C - used to 

PT modulate leukocyte trafficking, e.g. for treatment of inflammation, 

PT tumours, viral infections, auto-immune diseases, etc. 
XX 

PS Claim 16; Page 47-48; 65pp; English. 
XX 

CC This polypeptide sequence comprises novel human chemokine receptor 88C, a 

CC G protein coupled receptor that is involved in leukocyte trafficking. Its 

CC amino sequence was deduced from a cDNA clone (AAT85161) isolated from a 

CC macrophage library. It shows 62% identity to CCCKR1 . Chemokine receptor 

CC 88-2B (see AAW27124) has also been identified. 88C and 88-2B receptors 

CC and their polypeptide fragments can be produced in transformed host 

CC cells. The receptors, peptides comprising one or more of the 

CC extracellular or intracellular domains, and anti-receptor antibodies can 

CC be used to modulate receptor activities, particularly ligand and G 

CC protein binding, and are potentially potentially useful in the treatment 

CC of atheroslerosis, rheumatoid arthritis, tumours, asthma, viral 

CC infection, AIDS, inflammatory conditions, pathological immune response, 

CC abnormal haematopoietic processes etc 

XX 

SQ Sequence 352 AA; 


Query Match 47.6%; Score 37 9; DB 2; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.2e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I 1 I I I I I I I I I I I I II I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I II I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 118 

Q y 65 LCTRSQKEGLHYTC 78 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYLAWHAVFALKARTWFGVVTSVITWWAV 178 


Qy 7 9 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 5 
AAW23835 

ID AAW23835 standard; protein; 352 AA. 
XX 

AC AAW23835; 
XX 

DT 08-JUN-1998 (first entry) 
XX 

DE Human CC chemokine receptor 5 (CCR5) . 
XX 

KW CC chemokine receptor 5; CCR5; G-protein coupled receptor; 

KW human immunodeficiency virus; HIV; CD4; AIDS; therapy; transgenic animal. 
XX 

OS Homo sapiens. 


XX 

FH Key Location/Qualif iers 

FT Domain 29. .55 

FT /label= I 

FT /note= "transmembrane domain" 

FT Domain 104. .126 

FT /label= III 

FT /note= "transmembrane domain" 

FT Region 109. .120 

FT /note= "extracellular loop-1 (Claim 19)" 

FT Domain 143. .171 

FT /label= IV 

FT /note= "transmembrane domain" 

FT Region 187. .210 

FT /note= "extracellular loop-2 (Claim 19)" 

FT Domain 194. .219 

FT /label= V 

FT /note= "transmembrane domain" 

FT Domain 238. .258 

FT /label= VI 

FT /note= "transmembrane domain" 

FT Region 261. .276 

FT /note= "extracellular loop-3 (Claim 19)" 

FT Domain 277. .300 

FT /label= VII 

FT /note= "transmembrane domain" 

XX 


PN W09745543-A2 . 
XX 

PD 04-DEC-1997. 
XX 

PF 28-MAY-1997; 97WO-US009586 . 
XX 

PR 28-MAY-1996; 96US-0018508P . 
XX 

PA (USSH ) US DEPT HEALTH & HUMAN SERVICES. 
XX 

PI Combadiere C, Feng Y, Berger EA, Alkhatib G, Murphy PM; 

PI Broder CC, Kennedy PE; 

XX 

DR WPI; 1998-032650/03. 


DR N-PSDB; AAT76920. 
XX 

PT CC chemokine receptor 5 polypeptide - used to inhibit membrane fusion 

PT between HIV and a target cell. 

XX 

PS Claim 68; Fig 1C; 70pp; English. 
XX 

CC This protein sequence comprises of a novel human macrophage-selective CC 

CC chemokine receptor that has been designated CCR5 . The sequence was 

CC deduced from an isolated cDNA clone (see AAT76920) . An Alal27Leu variant 

CC (see W238340 of CCR5 was also identified. The susceptibility of human 

CC macrophages to HIV infection depends on cell surface expression of CD4 

CC and CCR5 . CCR5 is a member of the 7-transmembrane superfamily of G- 

CC protein coupled cell surface molecules. It plays an essential role in the 

CC membrane fusion step of infection by some HIV isolates. The establishment 

CC of stable, non-human cell lines and transgenic mammals having cells that 

CC coexpress human CD4 and CCR5 provides valuable tools for research of HIV 

CC infection. Antibodies that bind to CCR5, CCR5 variants, and CCR5-binding 

CC agents capable of blocking membrane fusion between HIV and target cells 

CC represent potential anti-HIV therapeutics for macrophage tropic strains 

CC of HIV 

XX 

SQ Sequence 352 AA; 

Query Match 47.6%; Score 379; DB 2; Length 352; ■ 

Best Local Similarity 35.0%; Pred. No. 3.2e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

II I I I I I I I I I I I i I I I I I I I I II I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDI YLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYLAWHAVFALKARTWFGWTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I II I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I II I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 6 
AAW88232 

ID AAW88232 standard; protein; 352 AA. 
XX 

AC AAW88232; 
XX 

DT 15-MAR-1999 (first entry) 
XX 


DE HIV-1 co-receptor CCR5 . 
XX 

KW HIV-1; CCR5; CCR5m303; co-receptor; infection; diagnosis; AIDS; 

KW gene therapy; human. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT Domain 32. .56 

FT /note= "transmembrane domain 1" 

FT Domain 67. .87 

FT /note= "transmembrane domain 2" 

FT Misc-dif ference 101 

FT /note= "corresponds to TGT (Cys) in wild-type CCR5, TGA 

FT (Stop) in CCr5m303" 

FT Domain 103. .124 

FT /note= "transmembrane domain 3" 

FT Domain 142. .167 

FT /note= "transmembrane domain 4" 

FT Domain 200. .223 

FT /note= "transmembrane domain 5" 

FT Domain 236. .260 

FT /note= "transmembrane domain 6" 

FT Domain 275. .301 

FT /note= "transmembrane domain 7" 
XX 

PN W09854317-A1. 
XX 

PD 03-DEC-1998. 
XX 

PF 29-MAY-1998; 98WO-EP003437 . 
XX 

PR 30-MAY-1997; 97US-0048057P . 
XX 

PA (MOND-) FOND MONDIALE RECH & PREVENTION SIDA. 
XX 

PI Beretta A, Quillent C, Arenzana Siesdedos F, Braun J; 
XX 

DR WPI; 1999-059835/05. 

DR N-PSDB; AAV84126. 
XX 

PT New CCR5 variant protein of the HIV-1 co-receptor - useful in developing 

PT resistance of CCR5-expressing cells to HIV-1 infection. 

XX 

PS Disclosure; Page 34-35; 55pp; English. 
XX 

CC This is the amino acid sequence of wild-type human CCR5, which serves as 

CC a co-receptor for infection by macrophage-tropic (M-tropic) strains of 

CC HIV-1. The invention relates to the identification of a CCR5 variant (see 

CC AAW88231), designated CCR5m303, comprising the first two transmembrane 

CC domains of wild-type CCR5 , but lacking transmembrane domains 3-7. The 

CC presence of the CCR5m303 variant with the wild type CCR5 allele shows a 

CC positive correlation with resistance to infection with M-tropic HIV-1 

CC strains , and may indicate slower progression of the disease. The 

CC detection of CCR5 variants may be used to identify individuals at lower 

CC risk of infection relative to the general population who, if infected, 

CC may exhibit slower progression to AIDS. Probes and primers (see AAV84127- 


CC 36) are provided for use in diagnostic methods for detecting the presence 

CC of such variants. A method is provided for inhibiting HIV-1 infection of 

CC a cell expressing the CCR5 receptor. This involves introducing a nucleic 

CC acid encoding a CCR5 variant into the cell, thereby reducing the number 

CC of functional CCR5 molecules present on the cell surface 
XX 

SQ Sequence 352 AA; 


Query Match 47.6%; Score 379; DB 2; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.2e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHS YKCG- — 64 

I I I II I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYLAVVHAVFALKARTWFGVVTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 17 9 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 7 
AAE07048 

ID AAE07048 standard; protein; 352 AA. 
XX 

AC AAE07048; 
XX 

DT 16-OCT-2001 (first entry) 
XX 

DE Human G-protein chemokine receptor (CCR5) HDGNR10 protein #2. 
XX 

KW Human; G-protein chemokine receptor; CCR5; HDGNR10; inflammation; HIV; 

KW human immunodeficiency virus; antimicrobial; vasodilator; vulnerary; 

KW cytostatic; immunosuppressive; nootropic; neuroprotective; gene therapy; 

KW neurodegenerative disorder; Kaposi's sarcoma; autoimmune disease; 

KW rheumatoid arthritis; cancer; breast; ovary; adrenal gland; bone marrow; 

KW gastrointestinal tract; lung; liver; immune disorder; Addison's disease; 

KW haemolytic anaemia; autoimmune thyroiditis; diabetes mellitus; allergy; 

KW multiple sclerosis; ulcerative colitis; Crohn's disease; wound healing; 

KW cardiovascular disorder; myocardial ischaemia. 

XX 

OS Homo sapiens . 
XX 

PN WO200158916-A2. 
XX 


PD 16-AUG-2001. 
XX 

PF 09-FEB-2001; 2001WO-US004153 . 
XX 

PR 09-FEB-2000; 2000US-0181258P . 

PR 09-MAR-2000; 2000US-0187999P . 

PR 22-SEP-2000; 2 0O0US-O234336P . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Rosen CA, Roschke V, Li Y, Ruben SM; 
XX 

DR WPI; 2001-488966/53. 

DR N-PSDB; AAD13299. 
XX 

PT Isolated nucleic acid encoding a human G-protein chemokine receptor 

PT (CCR5) HDGNR10 polypeptide, useful for preventing or treating autoimmune 

PT diseases e.g. rheumatoid arthritis, hyperprolif erative disorders and 

PT neurodegenerative disorders. 

XX 

PS Example 40; Page 504-505; 518pp; English. 
XX 

CC The invention relates to human G-protein chemokine receptor (CCR5) 

CC HDGNR10 polypeptides and polynucleotides. CCR5 HDGNR10 antibodies are 

CC useful for treating, preventing or ameliorating a disease or disorder 

CC associated with inflammation, defective or aberrant chemotaxis of immune 

CC cells, HIV infection (such as Pneumocystis carinii pneumonia or Kaposi's 

CC sarcoma) or defective or aberrant T-cell antigen presenting cell 

CC interaction. The disease or disorder may also be an infectious disease 

CC (e.g. a viral infection such as an early stage HIV infection, a 

CC cytomegalovirus infection, or a poxvirus infection) , an autoimmune 

CC disease (e.g. rheumatoid arthritis) or a neurodegenerative disorder. The 

CC disease or disorder may be associated with aberrant CCR5 expression, lack 

CC of CCR5 function, aberrant CCR5 ligand expression, or lack of CCR5 ligand 

CC function. CCR5 HDGNR10 protein is used as a food additive or preservative 

CC to increase or decrease storage capabilities. CCR5 HDGNR10 DNA are useful 

CC for chromosome identification and in gene therapy. CCR5 HDGNR10 DNA, 

CC protein, antibodies, agonists and antagonists are also useful in the 

CC diagnosis, treatment and prevention of cancer (breast, ovary, adrenal 

CC gland, bone, bone marrow, gastrointestinal tract, liver, lung, 

CC urogenital); immune disorders (Addison's disease, allergies, autoimmune 

CC haemolytic anaemia, autoimmune thyroiditis, diabetes mellitus, Crohn's 

CC disease, multiple sclerosis, rheumatoid arthritis and ulcerative colitis) 

CC ; cardiovascular disorders (myocardial ischaemias) and wound healing. The 

CC present sequence is human CCR5 HDGNR10 protein 

XX 

SQ Sequence 352 AA; 

Query Match 47.6%; Score 379; DB 4; Length 352; 
Best Local Similarity 35.0%; Pred. No. 3.2e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

Qy 1 MDYQVS S P I YDINYYT SEP CQKINVKQI AA 30 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 

Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 


Qy 


31 


YKCGLC' 


AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 


I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

Db 119 1 1 L LT I D R YLAWHAVFALKART VT FGWT S VI TWWAVFAS L P GI I FT RS QKE GLH YT C 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I II I I II I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I II I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 8 
AAG80111 

ID AAG80111 standard; protein; 352 AA. 
XX 

AC AAG80111; 
XX 

DT 17-JAN-2002 (first entry) 
XX 

DE Human CCR5 protein. 
XX 

KW Chemokine; tumour diagnosis; colorectal; prostatic; organ rejection; 

KW inflammation; autoimmune disease; metastasis; bronchial asthma; lupus; 

KW chronic bowel inflammation; rheumatoid arthritis; cytostatic; 

KW antiinflammatory; antiasthmatic; immunosuppressive; dermatological ; 

KW antirheumatic; antiarthritic . 

XX 

OS Homo sapiens. 
XX 

PN WO200172830-A2. 
XX 

PD 04-OCT-2001. 
XX 

PF 02-APR-2001; 2001WO-EP0037 08 . 
XX 

PR 31-MAR-2000; 2000DE-01016013 . 
XX 

PA (IPFP-) IPF PHARM GMBH. 

PA (FORS/) FORSSMANN U. 
XX 

PI Forssmann W, Adermann K, Heitland A, Spodsberg N; 
XX 

DR WPI; 2001-626256/72. 
XX 

PT Diagnostic agent containing two or more receptor-specific ligands, useful 

PT for detecting tumors, inflammation etc., also therapeutic use of ligand 

PT inhibitors. 
XX 

PS Disclosure; Page 10; 26pp; German. 
XX 

CC This invention describes a novel diagnostic agent (A) comprising at least 

CC two different ligands (I) for receptors (II) that are implicated in 


CC disease. (A) are used for the diagnosis of tumors (especially colorectal 

CC or prostatic) , organ rejection, inflammation and autoimmune diseases. 

CC Also inhibitors of (I) are used therapeutically against tumors (and their 

CC metastases), inflammation (particularly bronchial asthma or chronic bowel 

CC inflammation), or autoimmune diseases (rheumatoid arthritis or lupus), 

CC where the (cardio) vascular, lymphatic, respiratory, nervous, digestive, 

CC endocrine, motor or urogenital systems or skin are affected, and bone 

CC marrow diseases. The products of the invention are chemokine derivatives 

CC which have cytostatic, antiinflammatory, antiasthmatic, 

CC immunosuppressive, dermatological , antirheumatic, antiarthritic . 

CC Chemokines act on specific tumor and inflammatory cells through a 

CC constellation of chemokine receptors (CR) , which control migration and 

CC proliferation of these cells. AAG80045-AAG8 012 8 represent human chemokine 

CC fragments used to illustrate the method of the invention 

XX 

SQ Sequence 352 AA; 


Query Match 47.6%; Score 379; DB 4; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.2e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

Qy 1 MD YQVS S P I YDIN YYT S EPCQKI NVKQI AA 30 

I I I I I I I I I I! I I I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYLAWHAVFALKARTWFGWTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 7 9 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I II I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 9 
AAE04321 

ID AAE04321 standard; protein; 352 AA. 
XX 

AC AAE04321; 
XX 

DT 04-SEP-2001 (first entry) 
XX 

DE Human chemokine receptor (CKR) , CC-CKR-5 related protein #2. 
XX 

KW Human; transformed mammalian cell; CD4 ; reporter gene; translocation; 

KW human immuno deficiency virus; HIV; long terminal repeat; LTR; therapy; 

KW chemokine receptor; CKR; cellular dysfunction; HIV infection; cofactor; 

KW CC-CKR-5; envelope glycoprotein; anti-HIV. 
XX 


OS Homo sapiens. 
XX 

PN US6258527-B1. 
XX 

PD 10-JUL-2001. 
XX 

PF 21-MAY-1997; 97US-00861105 . 
XX 

PR 20-MAY-1996; 96US-0017157P . 

PR 19-JUN-1996; 96US-002 0043P . 

PR 19-MAY-1997; 97US-00858660 . 
XX 

PA (AARO-) AARON DIAMOND AIDS RES CENT. 

PA (UYNY ) UNIV NEW YORK STATE. 

XX 

PI Littman DR, Deng H, Ellmeier W, Landau NR, Liu R; 
XX 

DR WPI; 2001-417127/44. 

DR N-PSDB; AAD08577. 
XX 

PT Transformed mammalian cell (I) that contains a CD4 gene, reporter gene 

PT and HIV LTR for identification of drugs and antibodies for treatment of 

PT HIV. 
XX 

PS Disclosure; Col 47-50; 37pp; English. 
XX 

CC The present invention relates to a transformed mammalian cell that 

CC contains a gene encoding CD4, a construct encoding a reporter gene under 

CC the regulation of an human immuno deficiency virus (HIV) long terminal 

CC repeat (LTR) and that has been transduced with a vector encoding a human 

CC chemokine receptor (CKR) where the CD4 and the CKR are present on the 

CC cell surface of transformed mammalian cell. The invention is useful for 

CC identifying drugs or antibodies that interfere with the translocation of 

CC HIV into transformed mammalian cell or for identifying a human chemokine 

CC receptor that facilitates the infection of a particular HIV strain into 

CC the transformed mammalian cell. Compounds identified can be used to treat 

CC cellular dysfunction and to prevent or combat HIV infection. The present 

CC sequence is a human chemokine receptor (CKR) , CC-CKR-5 related protein. 

CC CC-CKR-5 is the principal cofactor for entry mediated by the envelope 

CC glycoproteins of primary macrophage-tropic strains of HIV-1 
XX 

SQ Sequence 352 AA; 

Query Match 47.6%; Score 379; DB 4; Length 352; 
Best Local Similarity 35.0%; Pred. No. 3.2e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 


Qy 65 


LCTRSQKEGLHYTC 7 8 
: I I I I I I I I I I I I 


Db 


119 IILLTIDRY^VVHAVFALKARTVTFGVWSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 


Qy 7 9 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I II II I I I I I I I I I I I I : : I 

Db 17 9 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 10 
AAE07039 

ID AAE07039 standard; protein; 352 AA. 
XX 

AC AAE07039; 
XX 

DT 16-OCT-2001 (first entry) 
XX 

DE Human G-protein chemokine receptor (CCR5) HDGNR10 protein #2. 
XX 

KW Human; G-protein chemokine receptor; CCR5; HDGNR10; inflammation; HIV; 

KW human immunodeficiency virus; antimicrobial; vasodilator; vulnerary; 

KW cytostatic; immunosuppressive; nootropic; neuroprotective; gene therapy; 

KW neurodegenerative disorder; Kaposi's sarcoma; autoimmune disease; 

KW rheumatoid arthritis; cancer; breast; ovary; adrenal gland; bone marrow; 

KW gastrointestinal tract; lung; liver; immune disorder; Addison's disease; 

KW haemolytic anaemia; autoimmune thyroiditis; diabetes mellitus; allergy; 

KW multiple sclerosis; ulcerative colitis; Crohn's disease; wound healing; 

KW cardiovascular disorder; myocardial ischaemia. 

XX 

OS Homo sapiens. 
XX 

PN WO200158915-A2. 
XX 

PD 16-AUG-2001. 
XX 

PF 09-FEB-2001; 2001WO-US004 152 . 
XX 

PR 09-FEB-2000; 2000US-018 1258P . 

PR 09-MAR-2000; 2000US-0187999P . 

PR 22-SEP-2000; 2000US-0234336P . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Rosen CA, Roschke V, Li Y, Ruben SM; 
XX 

DR WPI; 2001-488965/53. 

DR N-PSDB; AAD13198. 
XX 

PT Isolated nucleic acid encoding a human G-protein chemokine receptor 

PT (CCR5) HDGNR10 polypeptide, useful for preventing or treating autoimmune 

PT diseases e.g. rheumatoid arthritis, hyperprolif erative disorders and 

PT neurodegenerative disorders. 

XX 

PS Example 40; Page 486-487; 495pp; English. 
XX 


CC The present sequence is human G-protein chemokine receptor (CCR5) HDGNR10 

CC protein. CCR5 HDGNR10 antibodies are useful for treating, preventing or 

CC ameliorating a disease or disorder associated with inflammation, 

CC defective or aberrant chemotaxis of immune cells, HIV infection (such as 

CC Pneumocystis carinii pneumonia or Kaposi's sarcoma) or defective or 

CC aberrant T-cell antigen presenting cell interaction. The disease or 

CC disorder may also be an infectious disease (e.g. a viral infection such 

CC as an early stage HIV infection, a cytomegalovirus infection, or a 

CC poxvirus infection), an autoimmune disease (e.g. rheumatoid arthritis) or 

CC a neurodegenerative disorder. The disease or disorder may be associated 

CC with aberrant CCR5 expression, lack of CCR5 function, aberrant CCR5 

CC ligand expression, or lack of CCR5 ligand function. CCR5 HDGNR10 protein 

CC is used as a food additive or preservative to increase or decrease 

CC storage capabilities. CCR5 HDGNR10 DNA are useful for chromosome 

CC identification and in gene therapy. CCR5 HDGNR10 DNA, protein, 

CC antibodies, agonists and antagonists are also useful in the diagnosis , 

CC treatment and prevention of cancer (breast, ovary, adrenal gland, bone, 

CC bone marrow, gastrointestinal tract, liver, lung, urogenital); immune 

CC disorders (Addison's disease, allergies, autoimmune haemolytic anaemia, 

CC autoimmune thyroiditis, diabetes mellitus, Crohn's disease, multiple 

CC sclerosis, rheumatoid arthritis and ulcerative colitis); cardiovascular 

CC disorders (myocardial ischaemias) and wound healing 

XX 

SQ Sequence 352 AA; 

Query Match 47.6%; Score 379; DB 4; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.2e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

MDYQVS S PI YDINYYTS EPCQKINVKQIAARLLPPLYSLVFI FGFVGNMLVI LI LINCKR 60 


Qy 

1 

Db 

1 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

79 

Db 

179 

Qy 

106 

Db 

239 


-YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I II I I I I I I : I : : I 


-LCTRSQKEGLHYTC 7 8 
: I I I I I I I I I I I I 


SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : | 


-GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I II I I I I II I I I I I I I : I 


RESULT 11 
AAB46858 

ID AAB46858 standard; protein; 352 AA. 
XX 

AC AAB46858; 
XX 

DT 16-AUG-2001 (revised) 


DT 02-AUG-2001 (revised) 

DT 04-MAY-2001 (first entry) 

XX 

DE Human HDGNR10 protein. 
XX 

KW HDGNR10; human; G-protein chemokine receptor; antiinflammatory; 

KW immunomodulatory; anticoagulant; antiallergic; immunosuppressive; 

KW cytostatic; antiparasitic; antipsoriatic; antirheumatic; antiarthritic; 

KW vasotropic; gene therapy; haematopoiesis ; wound healing; coagulation; 

KW angiogenesis; solid tumour; infection; leukemia; growth factor activity; 

KW T-cell mediated autoimmune disease; psoriasis; allergy; atherogenesis; 

KW anaphylaxis; malignancy; inflammation; histamine; IgE; silicosis; shock; 

KW immunoglobulin E-mediated allergic reaction; rheumatoid arthritis; 

KW prostaglandin-independent fever; bone marrow failure; sarcoidosis; 

KW hyper-eosinophilic syndrome; vulnerary. 

XX 

OS Homo sapiens . 
XX 

PN US2001000241-A1. 
XX 

PD 12-APR-2001. 
XX 

PF 29-NOV-2000; 2000US-00725285 . 
XX 

PR 06-JUN-1995; 95US-00466343 . 

PR 18-NOV-1998; 98US-00195662 . 

PR 25-JUN-1999; 99US-00339912 . 
XX 

PA (LIYY/) LI Y. 

PA (RUBE/) RUBEN S M. 

XX 

PI Li Y, Ruben SM; 
XX 

DR WPI; 2001-226317/23. 

DR N-PSDB; AAF26390. 
XX 

PT New human G-protein chemokine receptor polypeptides and polynucleotides, 

PT useful for identifying (ant ) agonists to the G-protein chemokine receptor. 
XX 

PS Claim la; Page 15; 22pp; English. 
XX 

CC This invention describes a novel receptor polypeptide (I) selected from 

CC (i) a fully defined 329 amino acid sequence (II) fully disclosed in the 

CC specification; and (ii) a polypeptide encoded by the cDNA contained in a 

CC plasmid, and fragments, analogs and derivatives of the polypeptide. The 

CC products of the invention have antiinflammatory, immunomodulatory, 

CC anticoagulant, antiallergic, immunosuppressive, vulnerary, cytostatic, 

CC antiparasitic, antipsoriatic, antirheumatic, antiarthritic and vasotropic 

CC activity and can be used for gene therapy. The G-protein chemokine 

CC receptors, HDGNR10, (I) are useful for screening for compounds which 

CC activate or inhibit activation of (I) . The products of the invention can 

CC also be used for stimulating haematopoiesis, wound healing, coagulation, 

CC angiogenesis, treating solid tumours, chronic infections, leukemia, T- 

CC cell mediated autoimmune diseases, parasitic infections, psoriasis, and 

CC stimulating growth factor activity. HDGNR10 is useful for treating 

CC allergy, atherogenesis, anaphylaxis, malignancy, chronic and acute 

CC inflammation, histamine and immunoglobulin E ( IgE) -mediated allergic 


CC reactions, prostaglandin-independent fever , bone marrow failure, 

CC silicosis, sarcoidosis, rheumatoid arthritis, shock and hyper- 

CC eosinophilic syndrome. (N.B. This record was resubmitted to correct 

CC errors in the keyword formatting) 

XX 

SQ Sequence 352 AA; 

Query Match 47.6%; Score 37 9; DB 4; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.2e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

Db 119 lILLTIDRYLAvA^VFALKARTVTFGvA^SVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 12 
ABB56342 

ID ABB56342 standard; protein; 352 AA. 
XX 

AC ABB56342; 
XX 

DT 18-FEB-2002 (first entry) 
XX 

DE Non-endogenous human GPCR protein, SEQ ID NO: 477. 
XX 

KW Human; G protein-coupled receptor; GPCR; non-endogenous; mutant; 

KW constitutively activated GPCR; agonist; disease. 

XX 

OS Homo sapiens. 

OS Synthetic. 
XX 

PN WO200177172-A2. 
XX 

PD 18-OCT-2001. 
XX 

PF 05-APR-2001; 2001WO-US011098 . 
XX 

PR 07-APR-2000; 2 OOOUS-O 195747P . 
XX 

PA (AREN-) ARENA PHARM INC. 


XX 

PI Lehmann-Bruinsma K, Liaw CW, Lin I; 
XX 

DR WPI; 2001-648759/74. 

DR N-PSDB; ABI97978. 
XX 

PT Identifying agonists of G protein-coupled receptors (GPCRs) for use in 

PT disease treatment, comprises contacting candidate compounds with versions 

PT of GPCRs. 
XX 

PS Claim 1; Page 277-278; 394pp; English. 
XX 

CC The invention relates to G protein-coupled receptors (GPCRs) for which 

CC the endogenous ligand has been identified. Non-endogenous constitutively 

CC activated versions of known GPCRs are used in the invention for the 

CC direct identification of candidate compounds as receptor agonists, 

CC inverse agonists or partial agonists. Such agonists are useful as 

CC therapeutic agents for diseases or disorders associated with GPCRs. The 

CC present sequence is a non-endogenous version of a known human GPCR 

XX 

SQ Sequence 352 AA; 

Query Match 47.6%; Score 379; DB 4; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.2e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I II I I I I I I I I I I I I I I II I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

QY 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYLAVVliAVFALKARTWFGVVrSVITWVVAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I II I I I I I I I I I I I I I : : | 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAKRLIF 238 

Qy 106 GGS YKCGLC- - — QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 13 
AAB83354 

ID AAB83354 standard; protein; 352 AA. 
XX 

AC AAB83354; 
XX 

DT 09-OCT-2001 (first entry) 
XX 

DE Human CCR5 protein sequence. 
XX 


KW Chemotactic chemokine receptor 5; gpl20; CD 4 ; therapy; HIV; CCR5; 

KW human immunodeficiency virus; anti-inflammatory disease; human. 
XX 

OS Homo sapiens. 
XX 

PN EP1118858-A2 . 
XX 

PD 25-JUL-2001. 
XX 

PF 03-JAN-2001; 2001EP-00300020 . 
XX 

PR 12-JAN-2000; 2000GB-00000659 . 

PR 12-JAN-2000; 2000GB-00000661 . 

PR 12-JAN-2000; 2000GB-00000663 . 
XX 

PA (PFIZ ) PFIZER LTD. 

PA (PFIZ ) PFIZER INC. 
XX 

PI Dobbs S, Perros M, Rickett GA; 
XX 

DR WPI; 2001-477088/52. 

DR N-PSDB; AAF87099. 
XX 

PT Determining if an agent can modulate CCR5-gpl20 interaction, comprises 

PT incubating the agent with CCR5 and gpl20 and determining if the agent 

PT modulates the interaction. 
XX 

PS Claim 1; Page 110; 113pp; English. 
XX 

CC This sequence represents the human CCR5 protein sequence. The invention 

CC relates to a method for determining whether an agent is capable of 

CC modulating the interaction of chemotactic chemokine receptor 5 (CCR5) 

CC with gpl20, comprising incubating the agent with CCR5 and gpl20 and 

CC determining whether the agent modulates the interaction, where gpl20 is 

CC associated with CD4 , and where the interaction is a low affinity binding. 

CC The method is used to identify an agent capable of modulating the 

CC interaction of CCR5 with gpl20. An agent identified by the method is used 

CC to prepare a pharmaceutical composition for the treatment of a disease or 

CC condition associated with CCR5 and gpl20 interaction, to treat a subject 

CC with a disease or condition associated with CCR5 and gpl20 interaction, 

CC and for preparing a pharmaceutical for treating human immunodeficiency 

CC virus (HIV). It can also be treat anti-inflammatory diseases. The method 

CC is commercially useful, amenable to high throughput screening, and 

CC detects interaction of gpl20 with cells expressing only CCR5 

XX 

SQ Sequence 352 AA; 

Query Match 47.6%; Score 379; DB 4; Length 352; 
Best Local Similarity 35.0%; Pred. No. 3.2e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 

Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

II I I I I I I I I I I I I I : I :: I 


Db 


61 LKSMTDI YLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 


Qy 


6 5 LCTRSQKEGLHYTC 78 



Db 


119 IILLTIDRYLAVVHAVFALKARTVTFGVVTSVITWVVAVFASLPGIIFTRSQKEGLHYTC 178 


Qy 


Db 



Qy 


106 GGSYKCGLC QEFFGLNNCS S SNRLDGHQRV 135 



Db 


239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 14 
AAB82948 

ID AAB82948 standard; protein; 352 AA. 
XX 

AC AAB82948; 
XX 

DT 21-DEC-2001 (first entry) 
XX 

DE Human HIV-1 co-receptor CCR5. 
XX 

KW CCR5; chemokine; co-receptor; human immunodeficiency virus type 1; HIV-1; 

KW infection; therapy; vaccine; anti-HIV-1. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT Binding-site 2. .18 

FT /note= "binds to HIV-1 gpl20" 
XX 

PN WO200164710-A2. 
XX 

PD 07-SEP-2001. 
XX 

PF 28-FEB-2001; 2001WO-US006699 . 
XX 

PR 29-FEB-2000; 2000US-0185667P . 

PR 19-MAY-2000; 2000US-0205839P . 

PR 07-FEB-2001; 2 001US- 0267231P . 
XX 

PA (PROG-) PROGENICS PHARM INC. 

PA (AARO-) AARON DIAMOND AIDS RES CENT. 

XX 

PI Dragic T, Olson WC; 
XX 

DR WPI; 2001-611273/70. 

DR N-PSDB; AAH26903. 
XX 

PT Novel compounds comprising specific amino acids within CCR5 (HIV 1 co- 

PT receptor) amino terminal domain including negatively charged and two 

PT sulfated tyrosine residues is useful for treating HIV infection in 

PT humans . 
XX 


PS Claim 1; Page 30; 163pp; English. 
XX 

CC The present sequence is that of human HIV-1 co-receptor CCR5 . Amino acids 

CC 2-18 in the N-terminal region of CCR5 comprise an HIV-1 gpl20-binding 

CC site that determines the specificity of the interaction between CCR5 and 

CC HIV-1 gpl20. Post-translational sulfation of the tyrosine residues in the 

CC CCR5 N-terminus is required for gpl20 binding and may critically modulate 

CC the susceptibility of target cells to HIV-1 infection in vivo. The 

CC invention provides claimed sulfated peptides ( see AAB82947) that are 

CC based on the CCR5 N-terminal region and which are effective for 

CC inhibiting HIV-1 binding to CCR5 . These peptides are used in claimed 

CC methods of inhibiting HIV infection of CD4+ cells , of preventing CD4+ 

CC cells from becoming infected with HIV, of treating a subject whose CD4+ 

CC cells are infected with HIV, and of identifying an agent which inhibits 

CC binding of a CCR5 ligand to a CCR5 receptor. The methods may be carried 

CC out in a subject, especially a human, infected (therapeutic method), not 

CC infected with HIV (prophylactic method) , or in a subject who is not 

CC infected with, but has been exposed to, HIV 
XX 

SQ Sequence 352 AA; 


Query Match 47.6%; Score 379; DB 4; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.2e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 1 MDYQVS S P I YDINYYT S EPCQKI NVKQIAA 30 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

QY 31 YKCGLC AAAQWD FGNTMCQHQRVHGHHHHS YKCG 64 

I I 1111111111111:1: : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGI FF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYLAWHAVFALKAJ^TVTFGVA^SVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I II I I I I I I I I : : | 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

QY 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 15 
AAU97152 

ID AAU97152 standard; protein; 352 AA. 
XX 

AC AAU97152; 
XX 

DT 13-AUG-2002 (first entry) 
XX 

DE Human G-protein chemokine receptor (CCR5) HDGNR10 #2. 
XX 

KW Human; G-protein chemokine receptor; CCR5; HDGNR10; inflammation; 


KW immune cell chemotaxis; autoimmune disease; rheumatoid arthritis; 

KW neurodegeneration; viral infection; Kaposi sarcoma; cancer; 

KW hyperprolif erative disease; neurological disease; receptor. 
XX 

OS Homo sapiens. 
XX 

PN US2002048786-A1. 
XX 

PD 25-APR-2002. 
XX 

PF 09-FEB-2001; 2001US-00779879 . 
XX 

PR 09-FEB-2000; 2000US-0181258P . 

PR 09-MAR-2000; 2000US-0187999P . 

PR 22-SEP-2000; 2000US-0234336P . 
XX 

PA (ROSE/) ROSEN C A. 

PA (ROSC/) ROSCHKE V. 

PA (LIYY/) LI Y. 

PA (RUBE/) RUBEN S M. 

XX 

PI Rosen CA, Roschke V, Li Y, Ruben SM; 
XX 

DR WPI; 2002-434754/46. 

DR N-PSDB; ABK51870. 
XX 

PT New nucleic acid encoding an antibody specific for the G-protein 

PT chemokine receptor CCR5 , useful for treatment and diagnosis of e.g. 

PT inflammation. 
XX 

PS Disclosure; Page 165-166; 180pp; English. 
XX 

CC The present invention relates to the isolation of a novel human G-protein 

CC chemokine receptor (CCR5) designated HDGNR10, and polynucleotide 

CC sequences encoding it. The invention also describes antibodies that bind 

CC human G-protein chemokine receptor (CCR5) HDGNR10 and polynucleotide 

CC sequences encoding the antibodies. The antibodies are useful for treating 

CC or preventing inf lamination, defective or aberrant chemotaxis of immune 

CC cells and T-cell/antigen-presenting cell interactions, infections and 

CC autoimmune diseases, rheumatoid arthritis, neurodegeneration, viral 

CC infections (especially early-stage human immune deficiency virus (HIV) , 

CC cytomegalovirus or pox virus infections, Kaposi sarcoma, or conditions 

CC associated with aberrant or deficient expression of the CCR5 receptor or 

CC its ligands. The antibodies are also useful to determine CCR5 expression, 

CC e.g. for diagnosis, prognosis and monitoring of cancer and other 

CC hyperprolif erative diseases. The polynucleotide sequences encoding human 

CC G-protein chemokine receptor (CCR5) HDGNR10 can be used to produce the 

CC recombinant receptor, and in the treatment of a wide range of diseases 

CC such as infectious diseases (e.g. influenza), neurological diseases (e.g. 

CC Parkinson's disease), and hyperprolif erative disorders (e.g. cancer). The 

CC present sequence represents human G-protein chemokine receptor (CCR5) 

CC HDGNR10 #2 

XX 

SQ Sequence 352 AA; 


Query Match 47.6%; Score 379; DB 5; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.2e-31; 


Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

Qy 65 LCTRSQKEGLHYTC 78 

: I I I I I I I I I I I I 

Db 119 1 1 LLTI DRYLAVVHAVFALKARTWFGVVTSVITWWAVFASLPGI I FTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I II I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


Search completed: March 5, 2004, 16:22:47 
Job time : 139 sees 


GenCore version 5.1,6 
Copyright (c) 1993 - 2004 Compugen Ltd. 


OM protein - protein search, using sw model 


Run on: March 5, 2004, 16:17:14 ; Search time 37.0556 Seconds 

(without alignments ) 
192.262 Million cell updates/sec 

Title: US-10-057-8 90A-10 

Perfect score: 797 

Sequence: 1 MDYQVSSPIYDINYYTSEPC GLNNCS S SNRLDGHQRVHAA 138 


Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 389414 seqs, 51625971 residues 


Total number of hits satisfying chosen parameters: 389414 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 


Database : Issued_Patents_AA: * 

1 : /cgn2_6/ptodata/2/iaa/5A_COMB . pep : * 

2 : /cgn2_6/ptodata/2/iaa/5B_COMB.pep: * 

3 : /cgn2_6/ptodata/2/iaa/6A_COMB . pep : * 
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5: /cgn2_6/ptodata/2/iaa/PCTUS_COMB.pep: * 

6: /cgn2_6/ptodata/2/iaa/backfilesl .pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 


RESULT 1 

US-09-087-232A-13 

; Sequence 13, Application US/09087232A 

; Patent No. 6153431 

; GENERAL INFORMATION: 

; APPLICANT: Quillent et al . 

TITLE OF INVENTION: HUMAN IMMUNODEFICIENCY VIRUS CO-RECEPTOR 

TITLE OF INVENTION: VARIANTS ASSOCIATED WITH RESISTANCE TO VIRUS 
INFECTION. 

NUMBER OF SEQUENCES: 23 

CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Baker & Botts, L.L.P. attn. Lisa Kole 

STREET: 30 Rockefeller Plaza 
CITY: New York 
; STATE: New York 

COUNTRY: USA 
; ZIP: 10112 

COMPUTER READABLE FORM: 


MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/09/ 08 7 , 232A 

FILING DATE: 2 8 MAY 1998 
CLASSIFICATION: 4 35 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 60/048,057 
FILING DATE: 30 MAY 1997 
ATTORNEY/AGENT INFORMATION: 
; NAME: KOLE, LISA B. 

; REGISTRATION NUMBER: 35 , 225 

REFERENCE/ DOCKET NUMBER: AP 31115 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (212) 408-2628 
TELEFAX: (212) 765-2519 
; INFORMATION FOR SEQ ID NO: 13: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 352 amino acids 

; TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-09-087-232A-13 

Query Match 47.6%; Score 37 9; DB 3; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.7e-34; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

II II I I II I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 78 

: I I I I I I II I I I I 

Db 119 IILLTIDRYIAVWAVFALKARTWFGVVTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 7 9 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

II I II I I I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I II I I I I I I II I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 2 

US-08-861-105-14 

; Sequence 14, Application US/08861105 

; Patent No. 6258527 

; GENERAL INFORMATION: 

APPLICANT: LITTMAN, DAN R. 


APPLICANT: DENG, HONGKUI 
APPLICANT: ELLMEIER, WILFRIED 
APPLICANT : LANDAU, NATHANIEL R. 
APPLICANT: LIU, RONG 

TITLE OF INVENTION: G-COUPLED RECEPTORS ASSOCIATED WITH 

TITLE OF INVENTION: MACROPHAGE- TROPHIC HIV, AND DIAGNOSTIC AND THERAPEUTIC 
TITLE OF INVENTION: USES THEREOF 
NUMBER OF SEQUENCES: 14 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: David A. Jackson, Esq. 

STREET: 411 Hackensack Ave, Continental Plaza, 4th 
STREET: Floor 
CITY: Hackensack 
STATE: New Jersey 
COUNTRY: USA 
ZIP: 07601 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/861, 105 
FILING DATE: 
CLASSIFICATION: 436 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/666,020 
FILING DATE: 19-JUN-1996 
CLASSIFICATION: 436 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/227,319 
FILING DATE: 13-APR-1994 
ATTORNEY/AGENT INFORMATION: 

NAME: Jackson Esq., David A. 
REGISTRATION NUMBER: 26,742 
REFERENCE/ DOCKET NUMBER: 1049-1-004 Nl 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 201-4 87-5800 
TELEFAX: 201-34 3-1684 
INFORMATION FOR SEQ ID NO: 14: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 352 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
HYPOTHETICAL: NO 
ORIGINAL SOURCE: 

ORGANISM: Homo sapiens 
US-08-861-105-14 

Query Match 47.6%; Score 379; DB 3; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.7e-34; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 


1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA- 
I I I I II I I I I I I I I I I I I I I II I I! I I I I I 


30 


Db 


1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 


Qy 


31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 



Db 


61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 


Qy 


65 LCTRSQKEGLHYTC 7 8 



Db 


119 1 1 LLT I DRYLAWHAVFALKART VT FGWT S VI TWWAVFAS L PGI I FTRS QKEGLH YTC 178 


Qy 


Db 



Qy 


106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 


i i i i i i i i i i i i i i i i i i ■ i 

239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 



Db 


RESULT 3 

US-08-575-967A-2 

; Sequence 2, Application US/08575967A 

; Patent No. 6265184 

; GENERAL INFORMATION: 

; APPLICANT: Gray et al. 

; TITLE OF INVENTION: Chemokine Receptor Materials and Methods 
; NUMBER OF SEQUENCES: 16 
; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Marshall, O f Toole, Gerstein, Murray & Borun 

STREET: 6300 Sears Tower, 233 S. Wacker Drive 
CITY: Chicago 

; STATE: Illinois 

; COUNTRY: USA 

; ZIP: 60606 

; COMPUTER READABLE FORM: 

; MEDIUM TYPE: Floppy disk 

; COMPUTER: IBM PC compatible 

; OPERATING SYSTEM: PC-DOS/MS-DOS 

; SOFTWARE: Patentln Release #1.0, Version #1.30 

CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/08/575, 967A 

FILING DATE: 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 

NAME: No. 6265184and, Greta E. 
; REGISTRATION NUMBER: 35,302 

REFERENCE/DOCKET NUMBER: 32918 
; TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 206-485-1900 

TELEFAX: 206-485-1662 
; INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 352 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 


FEATURE: 


; NAME/ KEY: misc_f eature 

; OTHER INFORMATION: /= "88C amino acid sequence" 

US-08-575-967A-2 

Query Match 47.6%; Score 379; DB 3; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.7e-34; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 3 0 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I 1111111111111:1: : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 

Qy 65 ■ LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYLAWHAVFALKARTWFGVA^SVITWWAVFASLPGIIFTRSQKEGLHYTC 17 8 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I :: I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 2 81 


RESULT 4 
US-08-833-752-5 

Sequence 5, Application US/08833752 
Patent No. 6448375 
GENERAL INFORMATION: 

APPLICANT: SAMSON, MICHEL 
APPLICANT: PARMENTIER, MARC 
APPLICANT: VASSART, GILBERT 
APPLICANT: LIBERT, FREDERICK 

TITLE OF INVENTION: ACTIVE AND INACTIVE CC-CHEMOKINES RECEPTOR 
TITLE OF INVENTION: AND NUCLEIC ACID MOLECULES ENCODING SAID RECEPTOR 
NUMBER OF SEQUENCES: 17 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Knobbe, Martens, Olson & Bear 
STREET: 620 Newport Center Drive 16th Floor 
CITY: Newport Beach 
STATE: CA 
COUNTRY: U.S.A. 
ZIP: 92660 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/833, 752 
FILING DATE: 9-APR-1997 
CLASSIFICATION: 536 


ATTORNEY/ AGENT INFORMATION: 
NAME: Altman, Daniel E 
REGISTRATION NUMBER: 34,115 
REFERENCE/ DOCKET NUMBER: 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 352 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-833-752-5 

Query Match 47.6%; Score 37 9; DB 4; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.7e-34; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

Qy 1 MD YQVS S P I YD I N Y YT SEP CQKI N VKQ I AA 30 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGI FF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I II I I I I I I II 

Db 119 IILLTIDRYIAVVmVFALKARTWFGWTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I II I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLV74VICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 2 81 


RESULT 5 

US-09-502-783A-2 

; Sequence 2, Application US/09502783A 
; Patent No. 6511826 
; GENERAL INFORMATION: 
; APPLICANT: Li, Yi 

APPLICANT: Ruben, Steven M. 
; TITLE OF INVENTION: Polynucleotides Encoding Human G-Protein Chemokine 
Receptor (CCR5) 

; TITLE OF INVENTION: HDGNR10 
; FILE REFERENCE: 1488.1150006 

; CURRENT APPLICATION NUMBER: US/ 0 9/502 , 7 83A 
; CURRENT FILING DATE: 2001-08-23 
; PRIOR APPLICATION NUMBER: 08/466,343 
; PRIOR FILING DATE: 1995-06-06 
; NUMBER OF SEQ ID NOS : 9 
; SOFTWARE: Patentln version 3.0 
; SEQ ID NO 2 
; LENGTH: 352 
TYPE: PRT 


; ORGANISM: Homo sapiens 
US-09-502-783A-2 


Query Match 47.6%; Score 37 9; DB 4; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.7e-34; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I II I I I I I I I II I I I II I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYIAWHAVFALKARTWFGVVTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 7 9 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVTyrVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 6 
US-09-796-202-1 

; Sequence 1, Application US/09796202 

; Patent No. 6548636 

; GENERAL INFORMATION: 

; APPLICANT: Dragic, Tat j ana 

; APPLICANT: Olson, William 

; TITLE OF INVENTION: SULFATED CCR5 PEPTIDES FOR HIV-1 INFECTION 

; FILE REFERENCE: 2048/61010/ JPW/SHS 

; CURRENT APPLICATION NUMBER: US/ 0 9/7 9 6 , 2 02 

; CURRENT FILING DATE: 2001-02-28 

; NUMBER OF SEQ ID NOS : 17 

SOFTWARE: Patentln version 3.0 
; SEQ ID NO 1 

LENGTH: 352 

TYPE: PRT 

ORGANISM: human 
US-09-796-202-1 

Query Match 47.6%; Score 379; DB 4; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.7e-34; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I 1 I I I I I I I II I I I : I : : I 


Db 


61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 


Qy 


Db 



Db 


Qy 



106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 


! I I I I I I I I I I I I I I II I • I 

239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 



Db 


RESULT 7 

US-09-045-583-52 

; Sequence 52, Application US/09045583 
; Patent No. 6287805 
; GENERAL INFORMATION: 

APPLICANT: Graham, Gerard J. et al . 

TITLE OF INVENTION: No. 6287805el Molecules of the G Protein-Coupled 
NUMBER OF SEQUENCES: 56 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: LAHIVE & COCKFIELD, LLP 

; STREET: 2 8 State Street 

CITY: Boston 

STATE: Massachusetts 

COUNTRY: USA 

ZIP: 02109 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
; CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/ 09/ 04 5 , 583 

; FILING DATE: 20-MAR-98 

; CLASSIFICATION: 435 

PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: 

; FILING DATE: 

; ATTORNEY/AGENT INFORMATION: 
; NAME: Mandragouras , Amy E. 

REGISTRATION NUMBER: 36,2 07 
; REFERENCE/DOCKET NUMBER: MNI-044 

; TELECOMMUNICATION INFORMATION: 

TELEPHONE: (617)227-74 00 

TELEFAX: (617)7 42-4214 
; INFORMATION FOR SEQ ID NO: 52: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 352 amino acids 

TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: peptide 
FRAGMENT TYPE: internal 
US-09-045-583-52 


Query Match 46.9%; Score 374; DB 3; Length 352; 

Best Local Similarity 34,6%; Pred. No. 6e-34; 

Matches 98; Conservative 8; Mismatches 27; Indels 150; Gaps 7 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I II I I I I I : I I I I II I I I I I I I I I I I 
Db 1 MDYQVS S P I YDI DYYTS EPCQKINVKQI AARLLPPLYS LVFI FGFVGNMLVI LI LINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : | 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 78 

: I I I I I I I I I I I I 

Db 119 1 1 LLT I DRYLAI VHAVFALKARTVT FG WTSVITWVVAVFASLPGI I FTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I II I I I I : : | 

Db 17 9 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCS S SNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 8 

US-09-534-185-52 

; Sequence 52, Application US/09534185 
; Patent No. 6403767 

GENERAL INFORMATION: 
; APPLICANT: Graham, Gerard J. et al. 

; TITLE OF INVENTION: No. 6403767el Molecules of the G Protein-Coupled 

; Heptahelical Receptor Superfamily and Uses 

; Therefor 

NUMBER OF SEQUENCES: 56 
; CORRESPONDENCE ADDRESS: 

ADDRESSEE: LAHIVE & COCKFIELD, LLP 

STREET : 28 State Street 

CITY: Boston 
; STATE: Massachusetts 

COUNTRY: USA 
; ZIP: 02109 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/534 , 185 

FILING DATE: 24-Mar-2000 
; CLASSIFICATION: <Unknown> 

PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/045,583 

FILING DATE: <Unknown> 
; ATTORNEY/AGENT INFORMATION: 

; NAME: Mandragouras , Amy E. 


REGISTRATION NUMBER: 36,207 
REFERENCE/DOCKET NUMBER: MNI-044 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: ( 617 ) 227-7400 
TELEFAX: ( 617 ) 742-4214 
INFORMATION FOR SEQ ID NO: 52: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 352 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
FRAGMENT TYPE: internal 
SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
US-09-534-185-52 

Query Match 46.9%; Score 374; DB 4; Length 352; 

Best Local Similarity 34.6%; Pred. No. 6e-34; 

Matches 98; Conservative 8; Mismatches 27; Indels 150; Gaps 7 

Qy 1 MDYQVSSPI YDINYYTSEPCQKINVKQIAA 30 

I I I I I II I I I I I : II I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I II I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYLAIVILAVFALKARTWFGVA/TSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I II I I I I I I I I I I I II : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I II I I I : I 

Db 239 TIMI VYFLFWAP YNI VLLLNT FQEFFGLNNCS S SNRLDQAMQV 281 


RESULT 9 

US-08-466-343D-2 

; Sequence 2, Application US/08466343D 

; Patent No. 6025154 

; GENERAL INFORMATION: 

; APPLICANT: LI, Yi 

; TITLE OF INVENTION: POLYNUCLEOTIDES ENCODING HUMAN G-PROTEIN 

TITLE OF INVENTION: CHEMOKINE RECEPTOR HDGNR10 (AS AMENDED) 
NUMBER OF SEQUENCES: 9 

; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: STERNE, KESSLER, GOLDSTEIN & FOX P.L.L.C. 

STREET: 1100 NEW YORK AVE., NW, SUITE 600 

CITY: WASHINGTON 
; STATE: DC 

; COUNTRY: USA 

; ZIP: 20005 

; COMPUTER READABLE FORM: 


MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/ 4 66, 34 3D 
FILING DATE: 06-JUN-1995 
; CLASSIFICATION: 435 

ATTORNEY/AGENT INFORMATION: 
; NAME: STEFFE, ERIC K. 

; REGISTRATION NUMBER: 36,688 

REFERENCE/ DOCKET NUMBER: 14 8 8 . 1150000/EKS/KLM 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: (202) 371-2600 

TELEFAX: (202) 371-2540 
; INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 352 amino acids 
TYPE: amino acid 
; TOPOLOGY: linear 

MOLECULE TYPE: protein 
US-08-466-343D-2 

Query Match 46.8%; Score 373; DB 3; Length 352; 

Best Local Similarity 34.6%; Pred. No. 7.8e-34; 

Matches 98; Conservative 7; Mismatches 28; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCPKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCQR 60 

QY 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LESMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYLAIV1LAVFALKARTWFGVVTSVITWVVAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I II I I I I I I I I I I : : | 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCS S SNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 10 
US-09-517-605-5 

; Sequence 5, Application US/09517605 

; Patent No. 6391567 

; GENERAL INFORMATION: 

; APPLICANT: Littman, Dan R. 

APPLICANT: Kwon, Douglas S. 
; APPLICANT: van Kooyk, Yvette 
; APPLICANT: Geijtenbeck, Tneo 


; TITLE OF INVENTION: METHODS OF USING A FACILITATOR OF RETROVIRAL ENTRY INTO 

TITLE OF INVENTION: CELLS 
; FILE REFERENCE: 1049-1-017 

; CURRENT APPLICATION NUMBER: US/09/517,605 

; CURRENT FILING DATE: 2000-03-02 

; NUMBER OF SEQ ID NOS : 17 

; SOFTWARE : PatentlnVer. 2.0 

; SEQ ID NO 5 

LENGTH: 352 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-09-517-605-5 

Query Match 45.0%; Score 359; DB 4; Length 352; 

Best Local Similarity 33.9%; Pred. No. 2.9e-32; 

Matches 96; Conservative 8; Mismatches 29; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I : I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPTYDIDYDTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVI LVLINCKR 60 

QY 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I II I I I I I : I : : | 

Db 61 LKSMTDI YLLNLAI SDLFFLLTVP FWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 78 

: I I I I I I I I I I I I 

Db 119 IILLTIDRY^IVHAVFALKARTVTFGVVTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 7 9 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I II I I I I I I I I I I I I I : : | 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I II I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 11 
US-08-724-984A-2 

; Sequence 2, Application US/08724984A 
; Patent No. 6388055 
; GENERAL INFORMATION: 

; APPLICANT: Derk Bergsma, Mary Brawner, and Usman Shabon 
; TITLE OF INVENTION: No. 6388055el Mouse Genomic Clone of the CC- 
TITLE OF INVENTION: CKR5 Receptor 
NUMBER OF SEQUENCES: 5 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: SmithKline Beecham Corporation 

STREET: 709 Swedeland Road, P.O. Box 1539 
; CITY: King of Prussia 

STATE: PA 
COUNTRY: USA 
ZIP: 19406-0939 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: DISKETTE, 3.5 INCH, 1.44 Mb STORAGE 
COMPUTER: IBM 4 86 


OPERATING SYSTEM: WINDOWS FOR WORKGROUPS 

SOFTWARE: MICROSOFT WORD 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/724 , 984A 

FILING DATE: October 3, 1996 

CLASSIFICATION: 800 
PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: 

FILING DATE: 
ATTORNEY/AGENT INFORMATION: 
; NAME: William T. Han 

REGISTRATION NUMBER: 34,344 

REFERENCE/ DOCKET NUMBER: ATG50023 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 610 270 5024 

TELEFAX: 610 270 5090 
; INFORMATION FOR SEQ ID NO: 2: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 354 
; TYPE: Amino Acid 

; TOPOLOGY: Linear 

US-08-724-984A-2 

Query Match 34.1%; Score 272; DB 4; Length 354; 

Best Local Similarity 27.9%; Pred. No. 1.6e-22; 

Matches 80; Conservative 12; Mismatches 29; Indels 166; Gaps 8; 

Qy 1 MDYQVSSP — IYDINYYTSEPCQKINVKQIAA 30 

I I : I I I lllhl I I I I I I I I I I I I I 
Db 1 MDFQGSVPTYIYDIDYGMSAPCQKINVKQIAAQLLPPLYSLVFIFGFAGNMMVFLILISC 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG- 64 

I I I I : I I I I I I : I : I I I 

Db 61 KKLKSVTDIYLLNLAISDLLFLLTLPFWAHYAANEWIFGNIMC — KVFTGVYHIGYFGGI 118 

Qy 65 LCTRSQKEGLHY 7 6 

: I I I I I I I II 

Db 119 FFIILLTIDRYIAI VlIAVFALKVTTWFGVITSvVTWvVAVFASLPEIIFTRSQKEGFHY 178 

Qy 77 TCSSHFPYSQYQFWKNFQTLK IHQRVHGGGGS YKC 111 

I I I I I I : : I I II I : I I I I I : I : : | 

Db 179 TCSPHFPHTQYHFWKSFQTLKMVILSLILPLLVMIICYSGILHT LFRCRNEKK 231 

Qy 112 GLCQEFFGLNNCSSSNRLD 130 

I I I I I I I I I I I I I I I I 

Db 232 RHRAVRLIFAIMIVYFLFWTPYNIVLLLTTFQEFFGLNNCSSSNRLD 278 


RESULT 12 
US-08-833-752-4 

Sequence 4, Application US/08833752 
Patent No. 6448375 
GENERAL INFORMATION: 

APPLICANT: SAMSON, MICHEL 
APPLICANT: PARMENTIER, MARC 
APPLICANT: VASSART, GILBERT 
APPLICANT: LIBERT, FREDERICK 


TITLE OF INVENTION: ACTIVE AND INACTIVE CC-CHEMOKINES RECEPTOR 
TITLE OF INVENTION: AND NUCLEIC ACID MOLECULES ENCODING SAID RECEPTOR 
NUMBER OF SEQUENCES: 17 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Knobbe, Martens, Olson & Bear 
STREET: 62 0 Newport Center Drive 16th Floor 
CITY: Newport Beach 
STATE : CA 
COUNTRY: U.S.A. 
ZIP: 92660 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/833, 752 
FILING DATE: 9-APR-1997 
CLASSIFICATION: 536 
ATTORNEY/AGENT INFORMATION: 
NAME: Altman, Daniel E 
REGISTRATION NUMBER: 34,115 
REFERENCE/ DOCKET NUMBER: 
INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 184 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-833-752-4 

Query Match 32.4%; Score 258; DB 4; Length 184; 

Best Local Similarity 34.9%; Pred. No. 2.7e-21; 

Matches 65; Conservative 4; Mismatches 13; Indels 104; Gaps 4 

Qy 1 MD YQVS S P I YD I N Y YT SEP CQ KI N VKQ I AA 30 

I I I M I I I I I I I I I I I I I II I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

QY 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I II I I I II I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I II I 

Db 119 IILLTIDRYLAWHAVFALKARTWFGWTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 79 SSHFPY 84 

I I I I I I 

Db 179 SSHFPY 184 


RESULT 13 
US-09-087-232A-17 

; Sequence 17, Application US/09087232A 
; Patent No. 6153431 
; GENERAL INFORMATION: 


APPLICANT: Quillent et al . 

TITLE OF INVENTION : HUMAN IMMUNODEFICIENCY VIRUS CO-RECEPTOR 
TITLE OF INVENTION: VARIANTS ASSOCIATED WITH RESISTANCE TO VIRUS 
INFECTION. 

NUMBER OF SEQUENCES: 23 
; CORRESPONDENCE ADDRESS: 

ADDRESSEE: Baker & Botts, L.L.P. attn. Lisa Kole 
STREET: 30 Rockefeller Plaza 
CITY: New York 
STATE: New York 
; COUNTRY: USA 

ZIP: 10112 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/087 , 232A 

FILING DATE: 28 MAY 1998 

CLASSIFICATION: 435 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 60/048,057 
; FILING DATE: 30 MAY 1997 

ATTORNEY/AGENT INFORMATION: 

NAME: KOLE, LISA B. 

REGISTRATION NUMBER: 35,225 

REFERENCE/ DOCKET NUMBER: AP 31115 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: (212) 408-2628 
; TELEFAX: (212) 765-2519 

; INFORMATION FOR SEQ ID NO: 17: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 215 amino acids 

TYPE: amino acid 
; TOPOLOGY: linear 

MOLECULE TYPE: protein 
US-09-087-232A-17 


Query Match 32.4%; Score 258; DB 3; Length 215; 

Best Local Similarity 34.9%; Pred. No. 3.3e-21; 

Matches 65; Conservative 4; Mismatches 13; Indels 104; Gaps 4 

QY 1 MDYQVSSPI YDINYYTSEPCQKINVKQIAA 30 

I N I I I I I I I I I I I I I M I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I II I I : I : : | 

Db ^l LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I II 

Db 119 IILLTIDRYLAVVHAVFALKA.RTWFGWTS 178 


Qy 


79 SSHFPY 84 
I I I I I I 


Db 


179 SSHFPY 184 


RESULT 14 
US-08-833-752-6 

Sequence 6, Application US/08833752 
Patent No. 6448375 
GENERAL INFORMATION: 

APPLICANT: SAMSON, MICHEL 
APPLICANT : PARMENTIER, MARC 
APPLICANT: VASSART, GILBERT 
APPLICANT: LIBERT, FREDERICK 

TITLE OF INVENTION: ACTIVE AND INACTIVE CC-CHEMOKINES RECEPTOR 
TITLE OF INVENTION: AND NUCLEIC ACID MOLECULES ENCODING SAID RECEPTOR 
NUMBER OF SEQUENCES: 17 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Knobbe, Martens, Olson & Bear 
STREET: 620 Newport Center Drive 16th Floor 
CITY: Newport Beach 
STATE: CA 
COUNTRY: U.S.A. 
ZIP: 92660 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/833, 752 
FILING DATE: 9-APR-1997 
CLASSIFICATION: 536 
ATTORNEY/AGENT INFORMATION: 
NAME: Altman, Daniel E 
REGISTRATION NUMBER: 34,115 
REFERENCE/ DOCKET NUMBER: 
INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 215 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-833-752-6 

Query Match 32.4%; Score 258; DB 4; Length 215; 

Best Local Similarity 34.9%; Pred. No. 3.3e-21; 

Matches 65; Conservative 4; Mismatches 13; Indels 104; Gaps 4 

QY 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 3 0 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I II I I I I I I I : I : : | 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I II I I 


Db 119 IILLTIDRYIAVVHAVFALKARTVTFGWTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 


Qy 79 SSHFPY 84 

I I I I I I 

Db 179 SSHFPY 184 


RESULT 15 
US-09-087-232A-15 

; Sequence 15, Application US/09087232A 
; Patent No. 6153431 

GENERAL INFORMATION: 

APPLICANT: Quillent et al . 

TITLE OF INVENTION: HUMAN IMMUNODEFICIENCY VIRUS CO-RECEPTOR 
TITLE OF INVENTION: VARIANTS ASSOCIATED WITH RESISTANCE TO VIRUS 
INFECTION. 

; NUMBER OF SEQUENCES: 23 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Baker & Botts, L.L.P. attn. Lisa Kole 

STREET: 30 Rockefeller Plaza 

CITY: New York 

STATE: New York 

COUNTRY: USA 

ZIP: 10112 
; COMPUTER READABLE FORM: 

; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS-DOS 

; SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/ 08 7 , 2 32A 

FILING DATE: 28 MAY 1998 
; CLASSIFICATION: 435 

; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 60/048,057 
; FILING DATE: 30 MAY 1997 

ATTORNEY/AGENT INFORMATION: 

NAME: KOLE, LISA B. 

REGISTRATION NUMBER: 35,225 
; REFERENCE/ DOCKET NUMBER: AP 31115 

; TELECOMMUNICATION INFORMATION: 

TELEPHONE: (212) 408-2628 

TELEFAX: (212) 765-2519 
; INFORMATION FOR SEQ ID NO: 15: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 100 amino acids 

TYPE: amino acid 

TOPOLOGY: linear 
; MOLECULE TYPE: protein 
US-09-087-232A-15 

Query Match 22.9%; Score 182.5; DB 3; Length 100; 

Best Local Similarity 43.0%; Pred. No. 3.8e-13; 

Matches 43; Conservative 0; Mismatches 4; Indels 53; Gaps 2; 


Qy 


1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA- 
I I I I I I I II I II I I I I I I I I I I I I I I I I I I 


30 


Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWD FGNTM 47 

I I I I I I I I I I I I I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTM 100 

Search completed: March 5, 2004, 16:30:36 
Job time : 38.0556 sees 


GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 


OM protein - protein search, using sw model 


Run on: 


Title: 


March 5, 2004, 16:16:19 ; Search time 31.5185 Seconds 

(without alignments) 
421.163 Million cell updates/sec 

US-10-057-890A-10 


Perfect score: 797 

Sequence: 1 MDYQVSSPIYDINYYTSEPC GLNNCS S SNRLDGHQRVHAA 138 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 283366 seqs, 96191526 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 
Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 


283366 


Database 


PIR_78:* 
pirl : * 
pir2 : * 
pir3 : * 
pir4 : * 


Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 


RESULT 1 
A43113 

chemokine (C-C) receptor 5 - human 
N; Alternate names: C-C CKR-5; CCR5 
C; Species: Homo sapiens (man) 

C;Date: 12-Jul-1996 #sequence_revision 12-Jul-1996 #text_change 20-Jun-2000 
C;Accession: A43113; S71808; A58834; A58832; G02653; A58833 
R;Samson, M. ; Labbe, O.; Mollereau, C. ; Vassart, G. ; Parmentier , M. 
Biochemistry 35, 3362-3367, 1996 

A;Title: Molecular cloning and functional expression of a new human CC-chemokine 
receptor gene. 

A; Reference number: A43113; MUID : 9624 1590 ; PMID: 8639485 
A;Accession : A43113 
A;Molecule type: mRNA 
A; Residues: 1-352 <SAM1> 

A;Cross-references: GB:X91492; NID: gl262810; PIDN : CAA62796 . 1 ; PID:gl262811 
R;Samson, M. ; Libert, F. ; Doranz, B.J.; Rucker, J.; Liesnard, C; Farber, CM.; 
Saragosti, S.; Lapoumeroulie, C; Cognaux, J.; Forceille, C; Muyldermans, G. ; 
Verhofstede, C; Burtonboy, G. ; Georges, M. ; Imai, T.; Rana, S.; Yi, Y . ; Smyth, 
R.J.; Collman, R.G.; Doms, R.W. ; Vassart, G. ; Parmentier, M. 


Nature 382, 722-725, 1996 

A;Title: Resistance to HIV-1 infection in Caucasian individuals bearing mutant 

alleles of the CCR-5 chemokine receptor gene. 

A; Reference number: S71808; MUID : 96345670; PMID: 8751444 

A; Accession: S71808 

A; Status: nucleic acid sequence not shown; not compared with conceptual 

translation 

A; Molecule type: DNA 

A; Residues: 182-206; 207-230 <SAM2> 

A; Accession: A58 834 

A; Status: nucleic acid sequence not shown; not compared with conceptual 

translation 

A;Molecule type: DNA 

A; Residues: 1-184, ' IKDSHLGAGPAAACHGHLLLGNPKNSASVSK ' <SAM3> 

A; Cross-references: GB:X99393; NID: gl524062 ; PIDN : CAA67767 . 1 ; PID:gl524063 
A;Note: this frameshift mutation results in a non-functional receptor but 
confers a degree of resistance to HIV-1 infection; it has an allele frequency of 
0.09 or more in some Caucasian populations and may have had a selective 
advantage by conferring resistance to Yersinia plague infections 
R;Combadiere, C; Ahuja, S.K.; Tiffany, H.L.; Murphy, P.M. 
J. Leukoc. Biol. 60, 147-152, 1996 

A; Title: Cloning and functional expression of CC CKR5, a human monocyte CC 

chemokine receptor selective for MlP-lalpha, MlP-lbeta, and RANTES. 

A; Reference number: A58832; MUID: 96295970; PMID: 8699119 

A; Accession: A58 832 

A;Molecule type: mRNA 

A; Residues: 1-352 <C0M1> 

A; Cross-references : GB:U57840; NID : gl5024 08 ; PIDN : AAB17071 . 1 ; PID:gl502409 

A; Experimental source: clone 8, endotoxin-stimulated peripheral blood monocytes 

R;Combadiere, C. 

submitted to the EMBL Data Library, May 1996 
A; Reference number: H01541 
A; Accession: G02653 

A; Status: translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 1-89, 1 L ', 91-352 <COM2> 
A; Cross-references : EMBL :U57 84 0 

R;Raport, C.J.; Gosling, J.; Schweickart, V.L.; Gray, P.W.; Charo, I.F. 
J. Biol. Chem. 271, 17161-17166, 1996 

A; Title: Molecular cloning and functional characterization of a novel human CC 

chemokine receptor (CCR5) for RANTES, MlP-lbeta, and MlP-lalpha. 

A; Reference number: A58833; MUID: 96291862 ; PMID: 8663314 

A;Accession: A58833 

A; Molecule type: mRNA 

A; Residues: 1-352 <RAP> 

A;Cross-references : GB:U54994; NID : g!457945 ; PIDN: AAC50598 . 1 ; PID:gl457946 
C;Comment: This is a receptor for chemokines MlP-lalpha (see PIR:A30574), MIP- 
lbeta (see PIR:A31767), and RANTES (see PIR:A28815) . 

C; Comment: Macrophage- and dual-tropic strains of HIV-1 bind to a complex of 
chemokine (C-C) receptor 5 and T-cell surface glycoprotein CD4 (see PIR:RWHUT4). 
C; Genetics : 

A; Gene: GDB : CMKBR5 ; CCR5; CKR-5; CC-CKR-5; CKR5; ChemR13 
A/Cross-references: GDB: 1230510; OMIM: 601373 
A; Map position: 3p21-3p21 
C; Function : 

A;Description: G protein-coupled receptor for chemokines MlP-lalpha, MlP-lbeta 
and RANTES 


A;Note: probably acts to control granulocyte proliferation and differentiation 
C; Super family : vertebrate rhodopsin 

C; Keywords: AIDS; G protein-coupled receptor; glycoprotein; phosphoprotein; 
transmembrane protein 

F;32-56/Domain: transmembrane #status predicted <TMl> 
F; 67- 87 /Domain : transmembrane #status predicted <TM2> 
F; 103-124/Domain: transmembrane #status predicted <TM3> 
F; 142-16 6/ Domain : transmembrane #status predicted <TM4> 
F; 193-2 18/Domain : transmembrane #status predicted <TM5> 
F;236-257/Domain: transmembrane #status predicted <TM6> 
F;285-300/Domain: transmembrane #status predicted <TM7> 
F;20-269, 101-178/Disulf ide bonds: #status predicted 

F;268/Binding site: carbohydrate (Asn) (covalent) #status predicted 
F;336, 337, 342/Binding site: phosphate (Ser) (covalent) ffstatus predicted 
F;340, 343/Binding site: phosphate (Thr) (covalent) #status predicted 

Query Match 47.6%; Score 379; DB 2; Length 352; 

Best Local Similarity 35.0%; Pred. No. 2.5e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

MDYQVS S P I YD I N Y YT SEPCQKI NVKQIAA 30 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

MDYQVS S PI YDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVI LI LINCKR 60 

YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I II I I I I I : I : : I 

LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

IILLTIDRYLAWHAVFALKARTWFGVVTSVITWWAVFASLPGIIFTRSQKEGLHYTC 17 8 

SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : I 

SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 23 8 


Qy 

l 

Db 

l 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

79 

Db 

179 

Qy 

106 

Db 

239 


-GGSYKCGLC QEFFGLNNC S S SNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 


RESULT 2 
157505 

zinc finger protein - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 02-Aug-1996 #sequence_revision 02-Aug-1996 #text_change 05-Nov-1999 
C; Access ion : 157505 

R;Ernoult-Lange, M. ; Kress, M. ; Hamer, D. 
Mol. Cell. Biol. 10, 418-421, 1990 

A;Title: A gene that encodes a protein consisting solely of zinc finger domain, 
is preferentially expressed in transformed mouse cells. 
A;Reference number: 157505; MUID : 90097859 ; PMID:2104662 
A;Accession: 157505 

A; Status: preliminary; translated from GB/EMBL/DDB J 
A;Molecule type: mRNA 
A; Residues: 1-201 <RES> 

A;Cross-references: GB:M32057; NID:gl99766; PIDN : AAA39729 . 1 ; PID:gl99767 


C; Genetics : 
A; Gene: M0K2 


Query Match 19.3%; Score 154; DB 2; Length 201; 

Best Local Similarity 37.0%; Pred. No. 9.6e-08; 


Matches 44; Conservative 13; Mismatches 44; Indels 18; Gaps 6 

Qy 26 KQIAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHSYKCGLCTR — SQKEGLHYTCSSHF- 82 

I I : I I : : : : : I t I I I I I I I I : I I II : I 

Db 60 KPYACEECGMSFSQR SNLHIHQRVH-TGERPYKCGECGKGFSQSSNLHIHRCTHTG 114 

Qy 83 — PYSQYQFWKNF QTLKIHQRVHGGGGS YKCGLCQEFFGLNNCS S SNRLDGHQRVH 136 


ll I • ii i*iiiiii iiii*i i i • • i i i i i i 

115 EKPYQCYECGKGFSQSSDLRIHLRVHTGEKPYHCGKCGQGF SQSSKLLIHQRVH 168 


RESULT 3 
137570 

zinc finger protein - human (fragment) 
C; Species: Homo sapiens (man) 

C;Date: 29-May-1998 #sequence_revision 29-May-1998 #text_change Ol-Dec-2000 
C;Accession: 137570 

R;Abrink, M. ; Aveskogh, M. ; Hellman, L. 
DNA Cell Biol. 14, 125-136, 1995 

A; Title: Isolation of cDNA clones for 42 different Kruppel-related zinc finger 
proteins expressed in the human monoblast cell line U-937. 
A;Reference number: 137566; MUID: 95169271; PMID:7865130 
A;Accession: 137570 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-693 <RES> 

A;Cross-references: EMBL:X78927; NID:g498726; PIDN : CAA55527 . 1 ; PID:g498727 
C; Genetics : 
A; Gene: HZF4 

C; Superf amily : zinc finger protein ZFP-36; LIM metal-binding repeat homology 

Query Match 18.1%; Score 144; DB 2; Length 693; 

Best Local Similarity 33.3%; Pred. No. 3e-06; 

Matches 39; Conservative 15; Mismatches 33; Indels 30; Gaps 7 

Qy 21 QKINVKQIAAYKCGLCAAA-QWDFGNTMCQHQRVHGHHHHSYKCGLCTRSQKEGLHYTCS 7 9 

||::: I I I I I : I I I I I I I I I I I I I : : : 

Db 554 QKVHTGE-KPYKCGECGKGFKWSLNLDM--HQRVH-TGEKPYTCGAC GKHFSQA 603 

Qy 80 SHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGLNNCSS SNRLDGHQRVH 136 

I : | : : | | | | | | | | : | : | I I : : I hill 

Db 604 S SLQLHQSVHTGEKPYKCDVCGKVF SRSSQLQYHRRVH 641 


RESULT 4 
T46469 

hypothetical protein DKFZp434G1930 . 1 - human (fragment) 
C; Species: Homo sapiens (man) 

C;Date: 04-Feb-2000 #sequence_revision 04-Feb-2000 #text_change 04-Feb-2000 
C; Access ion: T4 6469 

R;Blum, H. ; Bauersachs, S.; Mewes, H.W. ; Gassenhuber, J.; Wiemann, S. 
submitted to the Protein Sequence Database, January 2000 


A; Reference number: Z23034 

A; Access ion: T4 6469 

A; Status: preliminary 

A;Molecule type: mRNA 

A; Residues: 1-319 <AAA> 

A; Cross-references : EMBL: AL1374 83 

A; Experimental source: adult testis; clone DKFZp434G1930 
C;Genetics : 

A;Note: DKFZp434G1930 . 1 

Query Match 18.0%; Score 143.5; DB 2; Length 319; 

Best Local Similarity 32.8%; Pred. No. 1.6e-06; 

Matches 38; Conservative 15; Mismatches 44; Indels 19; Gaps 7; 

Qy 31 YKCGLCAAAQWDFGNTMCQHQRVHGHHHHSYKCGLCTRSQKEGLHYTCSSHF P 8 3 

I I I I : ::: I I I : I INI:: I | | 

Db 9 YKCNECEKT-FSHRSSLLSHQRIH-TGEKPYKCNECEKAFSNS — STLIKHLRVHTGEKP 64 

Qy 84 YSQYQFWKNF QTLKIHQRVHGGGGSYKCGLCQEFFGLNNCSSSNRLDGHQRVH 136 

I : II I I : I I I : I I I I I I I : : I I I : : I I II : I 
Db 65 YRCRECGKAFSQCSTLTVHQRIHTGEKLYKCGECEKAF NCRA — KLHRHQRIH 115 


RESULT 5 
153869 

zinc finger protein - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 29-May-1998 #sequence_revision 29-May-1998 #text_change 01-Dec-2000 

C;Accession: 153869 

R;Brady, J. P.; Piatigorsky, J. 

Gene 149, 299-304, 1994 

A;Title: A mouse cDNA encoding a protein with zinc- fingers and a KRAB domain 
shows similarity to human prof ilaggrin . 

A; Reference number: 153869; MUID : 950474 92 ; PMID: 7959006 
A;Accession: 153869 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-555 <RES> 

A;Cross-references: GB:L28167; NID:g758660; PIDN : AAA67545 . 1 ; PID:g758661 

C; Superf amily : zinc finger protein ZFP-36; LIM metal-binding repeat homology 

Query Match 17.7%; Score 141; DB 2; Length 555; 

Best Local Similarity 23.8%; Pred. No. 4.7e-06; 

Matches 50; Conservative 25; Mismatches 47; Indels 88; Gaps 11; 

Qy 4 QVSSPIYDINYYTSEPCQ KINVKQIAAYKCGLCAAAQWDFGNTMCQH 5 0 

: I : I : I I I I : I : : : : I I I : I : : : I 

Db 303 RVCTP VKPYTCEQCEKSLLVDQHLMSHVKVHTRE-RPYNCETCGSA-FSQASHLQDH 357 

Qy 51 QRVH GHHHHS YKCGLCTRSQKEGLHYTCSSHF- 82 

I I : I II I I I I I : I : I I I : 

Db 358 QRLHTGEKPFKCDACGKSFSRSSHLRSHQRVHTGEKPYKCGECGKS FICSSNLY 411 

Qy 83 PYSQYQFWKNF QTLKIHQRVHGGGGSYKCGLCQEFFGLNN 122 

II II : | : | | : | | II I : I : : I I : 

Db 412 IHQRVHTGEKPYKCVDCGKEFSRPSSLQAHQGIHTGEKSYVCTMCGKGYTLNSNLQVHLR 471 


Qy 123 C SSSNRLDGHQRVH 136 

I I I : : I I I I I I 

Db 472 VHTGEKPYSCDVCGKGFSRS SQLQSHQRVH 501 


RESULT 6 
F31201 

GLI-related finger protein HKR4 - human (fragments) 
C; Species: Homo sapiens (man) 

C;Date: 31-Mar-1990 #sequence_revision 31-Mar-1990 #text_change 17-Mar-1994 
C;Accession: F31201 

R;Ruppert, J.M. ; Kinzler, K.W.; Wong, A- J.; Bigner, S.H.; Kao, F.T.; Law, M.L.; 

Seuanez, H.N. ; O'Brien, S.J.; Vogelstein, B. 

Mol. Cell. Biol. 8, 3104-3113, 1988 

A; Title: The GLI-Kruppel family of human genes. 

A; Reference number: A93103; MUID : 89096896 ; PMID:2850480 

A; Accession: F312 01 

A;Molecule type: DNA 

A; Residues: 1-223 <RUP> 

C; Keywords: DNA binding; zinc finger 

Query Match 17.4%; Score 138.5; DB 2; Length 223; 

Best Local Similarity 30.7%; Pred. No. 3.4e-06; 

Matches 42; Conservative 13; Mismatches 39; Indels 43; Gaps 7; 

Qy 31 YKCGLCAAAQWDFGNTMCQHQRVHGHHHHSYKCGLCTRSQKEGLHYTCSSHFPYSQYQFW 90 

I : I I I I : : I I I : I : I I I I I I I : II : I I : I 

Db 96 YECGQCGRA-FSHSSHFTQHLRIH-NGEKPYKCGECVRHQR — LH TGEKPYACSQCG 148 

Qy 91 KNF QTLKIHQRVHGGGGSYK CGLCQEFFG 119 

II I I I I : I I I : I I I : I I 

Db 149 KAFIWSSVLIEHQRIHTGEKPYECSDCGKAFRGRSHFFRHLRTHTGEKPFACGACGKAFG 208 

Qy 12 0 LNNCS S SNRLDGHQRVH 136 

I : : I I I I I I 
Db 209 QSSQLIQHQRVH 220 


RESULT 7 
138599 

zinc finger protein ZNF134 - human 
C; Species: Homo sapiens (man) 

C;Date: 23-Feb-1996 #sequence_revision 23-Feb-1996 #text_change 05-Nov-1999 
C; Access ion: 13 8599 
R;Tommerup, N.; Vissing, H. 
Genomics 27, 259-264, 1995 

A;Title: Isolation and fine mapping of 16 novel human zinc finger-encoding cDNAs 

identify putative candidate genes for developmental and malignant disorders. 

A;Reference number: A57785; MUID: 96044430 ; PMID:7557990 

A;Accession: 138599 

A; Status: preliminary 

A; Molecule type: mRNA 

A; Residues: 1-348 <RES> 

A;Cross-references : EMBL:U09412; NID:g488552; PIDN : AAC50253 . 1 ; PID:g488553 

C; Genetics : 

A; Gene: GDB:ZNF134 

A;Cross-references : GDB: 137033 


A; Map position: 19ql3 . 4-19ql3 . 4 


Query Match 17.1%; Score 136.5; DB 2; Length 348; 

Best Local Similarity 27.3%; Pred. No. 8.2e-06; 

Matches 45; Conservative 13; Mismatches 46; Indels 61; Gaps 9; 

Qy 31 YKCGLCAAAQWDFGNTMCQHQRVHGHHHHSYKCGLC TRSQKEGLH YT 77 

Mill: : I : I I I I : I hi I I I : : I I 

Db 97 YKCSECGKA-FSRKDTLVQHQRIHS-GEKPYECSECGKAFSRKATLVQHQRIHTGERPYE 154 

Qy 78 CS SHFPYSQYQFWKNF QTLKIHQRVHGGGGSYKCGLC 114 

M II : I I I : I II I I I I I I I 

Db 155 CSECGKTFSRKDNLTQHKRIHTGEMPYKCNECGKYFSHHSNLIVHQRVHNGARPYKCSDC 214 

Qy 115 QEFF GLN — NCSSSNRLDG HQRVH 136 

: I I I : I I : I I I I : I 

Db 215 GKVFRHKSTLVQHESIHTGENPYDCSDCGKSFGHKYTLIKHQRIH 259 


RESULT 8 
139311 

Kruppel-type zinc finger protein ZNF74 - human 
C; Species: Homo sapiens (man) 

C;Date: 06-Sep-1996 #sequence_revision 06-Sep-1996 #text_change Ol-Dec-2000 
C;Accession: 139311; F42825; S37482 

R;Aubry, M. ; Demczuk, S.; Desmaze, C; Aikem, M. ; Aurias, A.; Julien, J. P.; 
Rouleau,, G.A. 

Hum. Mol. Genet. 2, 1583-1587, 1993 

A; Title: Isolation of a zinc finger gene consistently deleted in DiGeorge 
syndrome . 

A; Reference number: 139311; MUID : 94093543 ; PMID: 8268910 
A;Accession: 139311 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-572 <RES> 

A; Cross-references: EMBL:X71623; NID : gl050529 ; PIDN : CAA50632 . 1 ; PID:gl050530 
A;Note: submitted to the EMBL Data Library, April 1993, revised 01-NOV-1995 
R;Aubry, M. ; Marineau, C; Zhang, F.R.; Zahed, L.; Figlewicz, D. ; Delattre, O.; 
Thomas, G. ; de Jong, P.J.; Julien, J. P.; Rouleau, G.A. 
Genomics 13, 641-648, 1992 

A; Title: Cloning of six new genes with zinc finger motifs mapping to short and 
long arms of human acrocentric chromosome 22 (p and qll.2). 
A; Reference number: A42825; MUID : 92347859 ; PMID: 1639391 
A;Accession : F42 825 

A; Status: preliminary; not compared with conceptual translation 
A;Molecule type: nucleic acid 
A;Residues: 427-444, 'V , 446-511 <AUB> 

A;Note: sequence extracted from NCBI backbone (NCBIP : 109775 ) 

C;Superfamily: zinc finger protein ZFP-36; LIM metal-binding repeat homology 

Query Match 17.0%; Score 135.5; DB 2; Length 572; 

Best Local Similarity 29.2%; Pred. No. 1.7e-05; 

Matches 33; Conservative 14; Mismatches 25; Indels 41; Gaps 5; 

Qy 31 YKCGLCAAAQWDFGNTMCQHQRVHGHHHH SYKCGLCTRSQKEGLHYTCSSHFP 83 

hllll : I I I II I : I I I I : I I I 
Db 288 YRCGECGKA FNQRTHLTRHHRIHTGEKPYQCGSC GRAFT CHS 329 


Qy 


84 YSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGLNNCSSSNRLDGHQRVH 136 


Db 


. i • i ... i i • i i i • • i • i ♦ I I ill I 

330 SLTVHEKIHSGDKPFKCSDCEKAF NSRSRLTLHQRTH 366 


RESULT 9 
148668 

zinc finger protein 51 - mouse 

N; Alternate names: finger protein zfecl2 

C; Species: Mus musculus (house mouse) 

C;Date: 02-Jul-1996 #sequence_revision 02-Jul-1996 #text_change 08-Dec-2000 

C;Accession: 148668; A40984 

R; Burke, P.S.; Don, J.; Wolgemuth, D.J. 

Mamm. Genome 5, 387-389, 1994 

A;Title: Zfp-51, a murine zinc finger encoding gene mapping to the t-complex 
region of chromosome 17, encodes 19 contiguous zinc fingers and is ubiquitously 
expressed. 

A; Reference number: 148668; MUID : 94319090 ; PMID: 8043957 
A; Access ion: 14 8668 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-710 <RES> 

A;Cross-references : EMBL:X74855; NID:g488832; PIDN : CAA52 8 47.1; PID:g488833 
R;Crossley, P.H.; Little, P.F.R. 

Proc. Natl. Acad. Sci. U.S.A. 88, 7923-7927, 1991 

A;Title: A cluster of related zinc finger protein genes is deleted in the mouse 
embryonic lethal mutation t(wl8). 

A;Reference number: A40984; MUID: 91376058 ; PMID: 1680234 

A; Accession: A4 0984 

A; Status: preliminary 

A;Molecule type: mRNA 

A; Residues: 672-710 <CRO> 

A/Cross-references : GB:M74235 

C; Genetics : 

A; Gene: Zfp-51 

C;Superfamily: zinc finger protein ZFP-36; LIM metal-binding repeat homology 
C; Keywords: DNA binding; zinc finger 

Query Match 16.8%; Score 134; DB 2; Length 710; 

Best Local Similarity 28.6%; Pred. No. 2.9e-05; 

Matches 38; Conservative 20; Mismatches 39; Indels 36; Gaps 7; 

QY 9 IYDINYYTS EPCQKINVKQIAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHSYKC 63 

: I I : I I : I I I I I I I :: :: : I hi III 

Db 120 VYD-DYFSSAYSLMQPTAYIKEKPHQCGKCGKC FNTSSSLTVHHRIH-TKEKPYKC 173 

QY 64 GLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGLNNC 123 


Db 


174 SVCDKS 




208 


Qy 


124 SSSNRLDGHQRVH 136 


Db 


2 09 VQLSALKSHQKLH 221 


RESULT 10 


S03679 

finger protein (clone mkr5) - mouse (fragment) 
C; Species: Mus musculus (house mouse) 

C;Date: 04-Dec-1992 #sequence_revision 04-Dec-1992 #text_change Ol-Dec-2000 
C;Accession: S03679 

R;Chowdhury, K. ; Rohdewohld, H.; Gruss, P. 
Nucleic Acids Res. 16, 9995-10011, 1988 

A;Title: Specific and ubiquitous expression of different Zn finger protein genes 
in the mouse. 

A; Reference number: S03677; MUID: 89057528 ; PMID: 3143103 
A; Accession: S 03 67 9 

A; Status: not compared with conceptual translation 
A;Molecule type: mRNA 
A; Residues: 1-475 <CHO> 

A;Note: the sequence in fig. 2 is inconsistent with that shown in fig.l in having 
64-Asn, 159-Trp, 187-trp, 188-trp, and in lacking a Met after 64-Gly and an Asp 
after 78-Ser 

C;Superfamily: zinc finger protein ZFP-36; LIM metal-binding repeat homology 

Query Match 16.8%; Score 133.5; DB 2; Length 475; 

Best Local Similarity 27.8%; Pred. No. 2.2e-05; 

Matches 40; Conservative 24; Mismatches 49; Indels 31; Gaps 8; 

15 YTSEPCQK INVKQ IAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHSYK 62 

I I I I I : I I | : | : | | : :: I | | | | | : | 

Db 223 YTCEVCHKSFRYGSSLTVHQRIHTGEKPYECEICRKA-FSHHASLTQHQRVHS-GEKPFK 280 

Qy 63 CGLCTRSQKEGLHYTCSSHF PYSQYQFWKNF QTLKIHQRVHGGGGSYKCG 112 

I I :: :: :| :||: |: : |:| I I I I : I I ::| 

D *> 281 CKECGKAFRQNIH— LASHWRIHTGEKPFECGECGKSFSISSQLATHQRIHTGEKPFECK 338 

Qy 113 LCQEFFGLNNCSSSNRLDGHQRVH 13 6 

: I : : I I I I I : I 

Db 339 VCRKAFRQN IHLASHWRIH 357 


RESULT 11 
148689 

gene NK10 protein - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 02-Jul-1996 #sequence_revision 02-Jul-1996 #text_change Ol-Dec-2000 
C;Accession: 148689; S49078 

R;Lange, R. ; Christoph, A.; Thiesen, H.J.; Vopper, G. ; Johnson, K.R.; Lemaire, 
L.; Plomann, M. ; Cremer, H . ; Barthels, D. ; Heinleim, U.A.O. 
DNA Cell Biol. 14, 971-981, 1995 

A;Title: Developmentally regulated mouse gene NK10 eucodes a xZInc Finger 
Repressor Protein with differential NDA-Binding Domains. 
A; Reference number: 148689; MUID : 96069544 ; PMID:7576184 
A;Accession: 14 8 689 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-636 <RES> 

A;Cross-references: EMBL:X79828; NID:g506501; PIDN : CAA56225 . 1 ; PID:g506502 
C;Superfamily: zinc finger protein ZFP-36; LIM metal-binding repeat homology 


Query Match 16.8%; Score 133.5; DB 2; Length 636; 

Best Local Similarity 31.6%; Pred. No. 2.9e-05; 


Matches 37; Conservative 15; Mismatches 44; Indels 21; Gaps 6 


QY 31 YKCGLCAAAQWDFGNTMCQHQRVHGHHHHSYKCGLCTRS QKEGLHYTCSSHF 82 

1:1 II I : ::: lllhl hi II II | | | | 
Db 306 YQCSLCGKA- FQRSSSLVQHQRIH-TGEKPYRCNLCGRSFRHSTSLTQHEVTH SGEK 360 

QY 83 PYSQYQFWKNF QTLKIHQRVHGGGGSYKCGLCQEFFGLNNCSSSNRLDGHQRVH 136 

1= : II : I I : I I I : : I : I | | | | | |:| 

Db 361 PFQCKECGKAFSRCSSLVQHERTHTGEKPFECSICGRAFG QSPSLYKHMRIH 412 


RESULT 12 
T14757 

hypothetical protein DKFZp572C163 . 1 - human (fragment) 
C; Species: Homo sapiens (man) 

C;Date: 20-Sep-1999 #sequence_revision 20-Sep-1999 #text_change 01-Dec-2000 
C; Accession: T14757 

R;Wambutt, R. ; Heubner, D.; Mewes, H.W.; Gassenhuber, J.; Wiemann, S. 

submitted to the Protein Sequence Database, August 1999 

A;Reference number: Z18181 

A; Accession: T14757 

A; Status: preliminary 

A;Molecule type: mRNA 

A; Residues: 1-701 <WAM> 

A; Cross-references : EMBL : ALII 02 17 

A; Experimental source: adult subthalamic nucleus; clone DKFZp572C163 
C; Genetics : 

A;Note: DKFZp572C163 . 1 

C;Superfamily: zinc finger protein ZFP-36; LIM metal-binding repeat homology 

Query Match 16.8%; Score 133.5; DB 2; Length 701; 

Best Local Similarity 23.4%; Pred. No. 3.2e-05; 

Matches 41; Conservative 16; Mismatches 37; Indels 81; Gaps 8; 

Qy 31 YKCGLCAAAQWDFGNT MCQHQRVH GHHH-HS- 60 

I'll I I : I I I I : I Ml: 

Db 4 68 YECNQC GKTFSQRTHLCAHQRIHTGEKPYECNECGKTF7VDNSALRAHHRIHTG 520 

Qy 61 YKCGLCTRSQKEGLHY TCSSHFPYSQYQFWKNF 93 

I : I I : : : I II II : | | 

Db 521 EKPYECNDCGKTFSKTSHLR7VHLRTRSGEKPYECSECGKTFSEKSYVSAHQRVHTGEKPY 580 

QY 94 QTLKIHQRVHGGGGSYKCGLCQEFFGLNNCSSSNRLDGHQRVH 136 

ll::|||:| I l|:| I : I | : | I, :| 

Db 581 ECNVCGKPFAHNSTLRVHQRIHTGEKSYECNDCGKTF SQKSHLSAHQRIH 63 0 


RESULT 13 
A34612 

zinc finger protein ZNF7 - human 
N;Alternate names: zinc finger protein kox4 
C; Species: Homo sapiens (man) 

C;Date: 22-Jun-1990 #sequence_revision 22-Jun-1990 #text__change 01-Dec-2000 
C;Accession: A34612; A56409; S10421; 137972 

R;Lania, L. ; Donti, E. ; Pannuti, A.; Pascucci, A.; Pengue, G. ; Feliciello, I.; 
La Mantia, G. ; Lanfrancone, L. ; Pelicci, P.G. 
Genomics 6, 333-340, 1990 


A; Title: cDNA isolation, expression analysis, and chromosomal localization of 
two human zinc finger genes. 

A; Reference number: A34612; MUID: 90169993; PMID:2106481 
A; Access ion : A3 4 612 
A; Molecule type: mRNA 
A; Residues: 1-686 <LAN> 

A/Cross-references: GB:M29580; NID:g340445; PIDN : AAA61313 . 1 ; PID:g340446; 
GB:J04751 

R;Bray, P.; Lichter, P.; Thiesen, H.J.; Ward, D.C.; Dawid, I.B. 
Proc. Natl. Acad. Sci. U.S.A. 88, 9563-9567, 1991 

A; Title: Characterization and mapping of human genes encoding zinc finger 
proteins . 

A; Reference number: A56409; MUID : 92052132 ; PMID: 1946370 

A;Accession: A56409 

A;Molecule type: DNA 

A; Residues: 425-589 <BRA> 

A; Cross-references : GB:M7717 0 

R;Thiesen, H.J. 

submitted to the EMBL Data Library, March 1990 

A; Reference number: S10397 

A; Accession: Si 0421 

A;Molecule type: mRNA 

A; Residues: 413-468 <THI> 

A;Cross-references : EMBL : X52335 ; NID:g34165; PIDN : CAA.36561 . 1 ; PID:g930095 

R; Thiesen, H.J. 

New Biol. 2, 363-374, 1990 

A; Title: Multiple genes encoding zinc finger domains are expressed in human T 
cells . 

A; Reference number: 137949; MUID : 91145339 ; PMID:2288909 
A; Access ion: 137972 

A; Status: translated from GB/EMBL/DDBJ 

A;Molecule type: mRNA 

A; Residues: 413-468 <RES> 

A;Cross-references : EMBL:X52335; NID:g34165; PIDN : CAA.36561 . 1 ; PID:g930095 

C; Genetics : 

A; Gene: GDB : ZNF7 

A;Cross-references : GDB: 120509; OMIM: 194531 
A; Map position: 8q24 . 3-8q24 . 3 

C;Superfamily: zinc finger protein ZFP-36; LIM metal-binding repeat homology 
C;Keywords: DNA binding; zinc finger 

Query Match 16.5%; Score 131.5; DB 2; Length 686; 

Best Local Similarity 26.9%; Pred. No. 4.8e-05; 

Matches 45; Conservative 11; Mismatches 46; Indels 65; Gaps 8; 

Qy 31 YKCGLCAAAQWDFG — NTMCQHQRVHGHHHHSYKCGLCTRSQKEGLH 7 5 

III I I II : : : I I I I : I I I : : I 

Db 441 YKCNKCTKA FGCSSRLIRHQRTH-TGEKPFKCDECGKGFVQGSHLIQHQRIHTGEKP 496 

Qy 7 6 YTCS SHFPYSQYQFWKNFQ TLKIHQRVHGGGGSYKCG 112 

11= I I III I I I I II I I III 

Db 497 YVCNDCGKAFSQSSSLIYHQRIHKGEKPYECLQCGKAFSMSTQLTIHQRVHTGERPYKCN 556 

Qy 113 LCQEFFGLNN CSS SNRLDGHQRVH 136 

I : I I : II I : I I I I : I 

Db 557 ECGKAFSQNSTLFQHQIIHAGVKPYECSECGKAFSRSSYLIEHQRIH 603 


RESULT 14 
S70006 

finger protein zfOCl - guinea pig 

C; Species: Cavia porcellus (guinea pig) 

C;Date: 06-Dec-1996 #sequence_revision 13-Mar-1997 #text_change 05-Nov-1999 
C; Accession: S70006 

R;Rivolta, M.N.; Negrini, C. ; Wilcox, E.R. 
Biochim. Biophys . Acta 1306, 127-132, 1996 

A;Title: A novel zinc finger gene preferentially expressed in the retina and the 

organ of Corti localizes to human chromosome 12q24.3. 

A;Reference number: S70006; MUID: 96221281 ; PMID:8634327 

A; Access ion: S7 0006 

A; Status: preliminary 

A;Molecule type: mRNA 

A; Residues: 1-261 <RIV> 

A;Cross-references: EMBL:L26335; NID : gl237277 ; PIDN: AAC42 091 . 1 ; PID:gl237278 
C; Keywords: zinc 

Query Match 16.4%; Score 131; DB 2; Length 261; 

Best Local Similarity 28.8%; Pred. No. 2.1e-05; 

Matches 42; Conservative 16; Mismatches 44; Indels 44; Gaps 7; 

Qy 2 6 KQIAAYKCG LCAAAQWDFGNT — MCQHQRVHGHHHHSY 61 

I I : I I : I : I I : : I I I I I I : 

Db 85 KPYKCYECGKAFNWSPHLQIHMRVHTGEKPYVCSECGRGFSNSSNLCMHQRVH-TGEKPF 143 

Qy 62 KCGLCTRS QKEGLHYTCSSHFPYSQYQFWKNF QTLKIHQRVHGGGGSYK 110 

Ml- : : | : I I I : I I : I I I I I I I I : 

Db 144 KCEECGKAFRHTSSLCMHQRVH TGEKPYKCYECGKAFSQSSSLCIHQRVHTGEKPYR 200 

Qy 111 CGLCQEFFGLNNCSSSNRLDGHQRVH 136 

I I : I I I : I I I I I I 

Db 201 CCGCGKAF SQSSSLCIHQRVH 221 


RESULT 15 
JH0500 

zinc finger protein 29 - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 31-Mar-1992 #sequence_revision 31-Mar-1992 #text_change 08-Dec-2000 

C;Accession: JH0500 

R; Denny, P.; Ashworth, A. 

Gene 106, 221-227, 1991 

A;Title: A zinc finger protein-encoding gene expressed in the post-meiotic phase 
of spermatogenesis. 

A; Reference number: JH0500; MUID : 9203908 0 ; PMID: 1937051 
A; Accession: JH0500 
A;Molecule type: mRNA 
A; Residues: 1-614 <DEN> 

A;Cross-references : GB:X55126; NID:g55470; PIDN : CAA38 920 . 1 ; PID:g55471 

A; Experimental source: testis 

C; Keywords: DNA binding; zinc finger 

F;217-244/Region: zinc finger 

F;245-272/Region: zinc finger 

F;273-300/Region: zinc finger 

F;301-328/Region: zinc finger 


F;329- 
F;357- 
F;385- 
F;413- 
F;441- 
F;469- 
F;497- 
F;525- 
F;553- 
F;581- 


-356/Region: 
-384/Region: 
-412/Region: 
-440/Region: 
-468/Region: 
-496/Region: 
-524/Region: 
-552/Region: 
-580/Region: 
-608/Region: 


zinc 
zinc 
zinc 
zinc 
zinc 
zinc 
zinc 
zinc 
zinc 
zinc 


finger 
finger 
finger 
finger 
finger 
finger 
finger 
finger 
finger 
finger 


Query Match 16.2%; 
Best Local Similarity 30.3%; 
Matches 37; Conservative 


Score 129.5; DB 2; 
Pred. No. 6.8e-05; 
12; Mismatches 42; 


Length 614; 
Indels 31; 


Gaps 


6 


Qy 

Db 

Qy 

Db 


31 YKCGLCAAAQWDFGNTMCQHQRVHGHHHHSYKCGLCTRS QKEGLH 75 

|:||| : : : I : I I I I I II I : I : I I 

418 YQCGEC-GKNFSRSSNLATHRRTH-LVEKPYKCGLCGKSFSQSSSLIAHQGTHTGEKPYE 475 

7 6 -YTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGLNNCSSSNRLDGHQR 134 
II I : I I I II I I | : | I I : I I : : I I I I 

476 CLTCGESFSWS SNLIKHQRTHTGEKPYRCGDCGKGF SQRSQLWHQR 522 


Qy 135 VH 136 

I 

Db 523 TH 524 


Search completed: March 5, 2004, 16:28:55 
Job time : 31.5185 sees 


GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 


OM protein - protein search, using sw model 
Run on : 


Title: 

Perfect score: 
Sequence : 

Scoring table: 


March 5, 2004, 16:22:54 ; Search time 74.537 Seconds 

(without alignments) 
390.935 Million cell updates/sec 

US-10-057-890A-10 
797 

1 MDYQVSSPIYDINYYTSEPC GLNNCSSSNRLDGHQRVHAA 138 

BLOSUM62 

Gapop 10.0 , Gapext 0 . 5 


Searched: 809742 seqs, 211153259 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 


809742 


Database 


Published_Applications_AA: * 

1: /cgn2_6/ptodata/l/pubpaa/US07_PUBCOMB.pep: * 

2 : / cgn2_6/pt oda ta/ 1/pubpaa/ PCT_NEW_PUB . pep : * 

3 : /cgn2_6/ptodata/l/pubpaa/US06_NEW_PUB .pep : * 

4: /cgn2_6/ptodata/l/pubpaa/US06_PUBCOMB.pep: * 

5: /cgn2_6/ptodata/l/pubpaa/US07_NEW_PUB.pep: * 

6 : / cgn2_6/pt odata/ 1 /pubpaa/ PCTUS_PUBCOMB . pep : * 

7 : /cgn2_6/ptodata/l/pubpaa/US08_NEW_PUB . pep : * 

8: /cgn2_6/ptodata/l/pubpaa/US08_PUBCOMB.pep: * 

9: /cgn2_6/ptodata/l/pubpaa/US09A_PUBCOMB.pep:* 

10: /cgn2_6/ptodata/l/pubpaa/US09B_PUBCOMB.pep: * 

11: /cgn2_6/ptodata/l/pubpaa/US09C_PUBCOMB.pep: * 

12 : /cgn2_6/ptodata/l/pubpaa/US09_NEW_PUB.pep:* 

13 : /cgn2_6/ptodata/l/pubpaa/US10A_PUBCOMB.pep: * 

14 : /cgn2_6/ptodata/l/pubpaa/US10B_PUBCOMB.pep: * 

15 : /cgn2_6/ptodata/l/pubpaa/US10C_PUBCOMB.pep: * 

16: /cgn2_6/ptodata/l/pubpaa/US10_NEW_PUB.pep: * 

17: /cgn2_6/ptodata/l/pubpaa/US60_NEW_PUB.pep:* 

18: /cgn2_6/ptodata/l/pubpaa/US60_PUBCOMB.pep:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 


Result Query 

No. Score Match Length DB 


ID 
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Sequence 17, Appl 
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Sequence 5, Appli 
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9 
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Sequence 5, Appli 
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379 
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352 

9 

US-09-502-783A-2 

Secruence 2 AddII 

^ kJ v — X X k^ V— C» m il .kV k> _l L~ 

16 

379 

47 . 

6 

352 

10 

US- 0 9- 7 3 4-2 2 1A- 14 

Seauence 14. AddI 

k^ V— - k^ k* Vw X X w *w JL T ^ i*.M JL 

17 

379 

47 . 

6 

352 

11 

US- 09-82 6-509-477 

kV k-/ k/ ~S \J £-t W k/ *S Iff 

Seauence 477 Ann 

fc-J k_* k^ k* k^- x x k*» k-* "X f J # ilk/ 

18 

379 

47 . 

6 

352 
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379 

47 . 


352 

14 
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379 

47 . 
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352 

14 

US- 10- 067- 8 00-22 

Secruence 22, Annl 

22 

379 

47 . 

6 

352 

14 

US-1 0-2 90-0 5 8A- 6 

\y JL k/ £-» -y \J k/ \J -1 X V-/ 

Secruence 6. Annli 

k^ v_^- \a. k-. x x v-* k» k/ ^ iiy ky _i_ _i_ 
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352 
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14 
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14 
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6 

352 

14 
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Sequence 1, Appli 

27 

379 

47 . 

6 

352 

14 

US- 10- 135- 8 3 9-22 

Sequence 22, Appl 

28 

379 

47 . 


352 

14 

Us- 10-2 3 9- 42 3- 67 

Sequence 67 , Appl 

29 

379 

47 . 

6 

352 

14 

US-1 0-439-84 5- 4 

Sequence 4, Appli 

30 

379 

47 . 

6 

352 

15 

Us-10-360-828-1 

Seauence 1 . Aooli 

v w k^ k* X X k*» k- JU f A LK_/ k/ JL- JL. 

31 

374 

46 . 

9 

352 

14 

Us- 10- 164 -64 9- 52 

Sequence 52, Appl 

32 

374 

46. 

9 

352 

14 

US-10-439-845-2 

Sequence 2, Appli 

33 

373 

46 . 

8 

352 

9 

US-09-77 9-87 9A-2 

Seauence 2. AddIi 

k/ \J* V-^ kA VJ- X X *w k^» ^ ^ ilf / k/ JL. JL. 

34 

373 

46 . 

8 

352 

9 

US-09-779-8 8 0A-2 

Sequence 2, Appli 

35 

373 

46 . 

8 

352 

14 

US- 10- 067- 8 00-2 

V-/ / 1 U k/ W / VJ 1 \J k/ 

Seauence 2- Annli 

36 

373 

46. 

8 

352 

14 

Us-10-135-839-2 

Sequence 2, Appli 

37 

363 

45 . 

5 

352 

13 

Us- 10- 10 6- 62 3-20 

Sequence 20, Appl 

38 

258 

32 . 

4 

184 

9 

Us-09-938-719-4 

Sequence 4, Appli 

39 

258 

32 . 

4 

184 

9 

US-09-939-226-4 

Sequence 4, Appli 

40 

258 

32. 

4 

184 

9 

US-09-938-703-4 

Sequence 4, Appli 

41 

258 

32. 

4 

215 

9 

US-09-938-719-6 

Sequence 6, Appli 

42 

258 

32. 

4 

215 

9 

US-09-939-226-6 

Sequence 6, Appli 

43 

258 

32. 

4 

215 

9 

US-09-938-703-6 

Sequence 6, Appli 

44 

183.5 

23. 

0 

332 

14 

US-10-095-876A-2 

Sequence 2, Appli 

45 

182 

22. 

8 

32 

14 

US-10-057-890A-13 

Sequence 13, Appl 


ALIGNMENTS 


RESULT 1 

US-10-057-890A-10 

; Sequence 10, Application US/10057890A 

; Publication No. US20030044901A1 

; GENERAL INFORMATION: 

; APPLICANT: Coleman, Timothy 


; APPLICANT: Mansfield, Brian 

; TITLE OF INVENTION: Scaffold Fusion Polypeptides, Composition for Making the 
Same, and Methods 

TITLE OF INVENTION: of Using the Same. 
FILE REFERENCE: PF537 

CURRENT APPLICATION NUMBER: US/ 1 0/ 057 , 8 90A 
CURRENT FILING DATE: 2002-01-29 
PRIOR APPLICATION NUMBER: 60/265,782 
PRIOR FILING DATE: 2001-01-31 
PRIOR APPLICATION NUMBER: 60/265,858 
PRIOR FILING DATE: 2001-01-31 
NUMBER OF SEQ ID NOS: 32 
SEQ ID NO 10 
LENGTH: 13 8 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-057-890A-10 

Query Match 100.0%; Score 797; DB 14; Length 138; 

Best Local Similarity 100.0%; Pred. No. 3e-71; 

Matches 13 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHS 60 

I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHS 60 

Qy 61 YKCGLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGL 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 YKCGLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGL 120 

Qy 121 NNCSSSNRLDGHQRVHAA 13 8 

I I I I I I I I I I I I I I I I I I 

Db 121 NNCSSSNRLDGHQRVHAA 138 


RESULT 2 

US-10-057-890A-31 

; Sequence 31, Application US/10057890A 

; Publication No. US2 0030044 901A1 

; GENERAL INFORMATION: 

; APPLICANT: Coleman, Timothy 

; APPLICANT: Mansfield, Brian 

TITLE OF INVENTION: Scaffold Fusion Polypeptides, Composition for Making the 
Same, and Methods 

; TITLE OF INVENTION: of Using the Same. 

FILE REFERENCE: PF537 
; CURRENT APPLICATION NUMBER: US/10/057 , 890A 
; CURRENT FILING DATE: 2002-01-29 
; PRIOR APPLICATION NUMBER: 60/265,782 
; PRIOR FILING DATE: 2001-01-31 
; PRIOR APPLICATION NUMBER: 60/265,858 
; PRIOR FILING DATE: 2001-01-31 
; NUMBER OF SEQ ID NOS: 32 
; SEQ ID NO 31 

LENGTH: 157 

TYPE: PRT 
; ORGANISM: Homo sapiens 


US-10-057-890A-31 


Query Match 100.0%; Score 797; DB 14 

Best Local Similarity 100.0%; Pred. No. 3.4e-71 
Matches 138; Conservative 0; Mismatches 0 


Qy 


Db 


Qy 


Db 


Qy 


Db 


20 


Length 157; 

Indels 0; Gaps 0 

60 


MDYQVSSPIYDINYYTSEPCQKINVKQIAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHS 
I I I II I I I II I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I II II I I I II I I I I I I I 
MDYQVSSPIYDINYYTSEPCQKINVKQIAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHS 79 


61 YKCGLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGL 120 

I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
8 0 YKCGLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGL 139 

121 NNCSSSNRLDGHQRVHAA 138 

I I I I I I I I I I I I I I I I I I 
140 NNCSSSNRLDGHQRVHAA 157 


RESULT 3 
US-09-725-285-2 

; Sequence 2, Application US/09725285 

; Patent No. US20010000241A1 

; GENERAL INFORMATION: 

; APPLICANT: Li, Yi 

; APPLICANT: Ruben, Steven, M. 

; TITLE OF INVENTION: Antibodies to Human G-Protein Chemokine Receptor 
HDGNR10 

; TITLE OF INVENTION: (CCR5 Receptor) 

; FILE REFERENCE: 1488.1150003 

; CURRENT APPLICATION NUMBER: US/09/725, 285 

; CURRENT FILING DATE: 2000-11-29 

; PRIOR APPLICATION NUMBER: 09/339,912 

; PRIOR FILING DATE: 1999-06-25 

; PRIOR APPLICATION NUMBER: 09/195,662 

; PRIOR FILING DATE: 1998-11-18 

; PRIOR APPLICATION NUMBER: 08/466,343 

; PRIOR FILING DATE: 1995-06-06 

; NUMBER OF SEQ ID NOS : 9 

; SOFTWARE: Patentln version 3.0 

; SEQ ID NO 2 

; LENGTH: 352 

; TYPE: PRT 

; ORGANISM: Artificial Sequence: Genomic 
FEATURE : 

OTHER INFORMATION: Deduced Amino Acid Sequence 
US-09-725-285-2 

Query Match 47.6%; Score 379; DB 9; Length 352; 

Best Local Similarity 35,0%; Pred. No. 1.7e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

Qy 1 MDYQVSSPI YDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 


Qy 


31 


YKCGLC 


AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 


I I I I I I I I I I . I . I 

LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 



Db 


61 


Qy 


65 


LCTRSQKEGLHYTC 78 


Db 


119 


1 1 LLT I D R Y LAWHAVFAL KART VT F GWT S VI TWWAVFAS LPGI I FTRSQKEGLH YTC 178 


Qy 


Db 


179 


79 



Qy 


106 


GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 



Db 


239 


TIMI VYFLFWAP YNI VLLLNTFQEFFGLNNCS S SNRLDQAMQV 281 


RESULT 4 
US-09-759-841-2 

; Sequence 2, Application US/09759841 

; Patent No. US20010039026A1 

; GENERAL INFORMATION: 

; APPLICANT: Rickett, Graham A 

; APPLICANT: Dobbs, Susan 

; APPLICANT: Perros, Manoussos 

; TITLE OF INVENTION: Assay Method 

; FILE REFERENCE: PC1034 8APME 

; CURRENT APPLICATION NUMBER: US/09/759, 841 

; CURRENT FILING DATE: 2001-01-12 

; PRIOR APPLICATION NUMBER: GB 0000661.9 

; PRIOR FILING DATE: 2000-01-12 

; PRIOR APPLICATION NUMBER: GB 0000663.5 

; PRIOR FILING DATE: 2000-01-12 

; PRIOR APPLICATION NUMBER: GB 0000659.3 

; PRIOR FILING DATE: 2000-01-12 

; NUMBER OF SEQ ID NOS : 6 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 2 

; LENGTH: 352 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-759-841-2 

Query Match 47.6%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.7e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

Qy 1 MDYQVS S PI YDINYYT S EPCQKINVKQIAA 30 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MDYQVS S PI YDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

II I I I I I I I I I I I I I : I :: I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 78 


Db 


119 


1 1 LLTI DRYIAWHAVFALKART VTFGVVT SVI TWWAVFAS LPGI I FTRSQKEGLHYTC 17 8 


Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG — 105 

I I I II I I I I I I I I I I I I I I I : : I 

Db 17 9 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCS SSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 5 

US-09-779-879A-22 

; Sequence 22, Application US/09779879A 

; Patent No. US2002004 878 6A1 

; GENERAL INFORMATION: 

; APPLICANT: Rosen, Craig A. 

; APPLICANT: Roschke, Viktor 

; APPLICANT: Li, Yi 

APPLICANT: Ruben, Steven, M. 
; TITLE OF INVENTION: Human G-protein Chemokine Receptor (CCR5) HDGNR10 
; FILE REFERENCE: 1488.115000A 
; CURRENT APPLICATION NUMBER: US/ 09/77 9 , 8 7 9A 
; CURRENT FILING DATE: 2001-02-09 
; PRIOR APPLICATION NUMBER: US 60/181,258 
; PRIOR FILING DATE: 2000-02-09 
; PRIOR APPLICATION NUMBER: US 60/187,999 
; PRIOR FILING DATE: 2000-03-09 
; PRIOR APPLICATION NUMBER: US 60/234,336 
; PRIOR FILING DATE: 2000-09-22 
; NUMBER OF SEQ ID NOS : 58 
; SOFTWARE: Patentln version 3.0 
; SEQ ID NO 22 
LENGTH: 352 
TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-779-879A-22 

Query Match 47.6%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.7e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

Qy 1 MD YQVS S P I YD I N Y YT S E P CQ KI N VKQ I AA 30 

I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I II 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I II I I I I I I I I I I : I : : I 

Db 61 LKSMTDI YLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYLAWHAVFAXKARTWFGVVTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I II I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 


Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 6 

US-09-779-880A-22 

Sequence 22, Application US/09779880A 
Patent No. US20020061834A1 
GENERAL INFORMATION: 
APPLICANT: Rosen, Craig A. 
APPLICANT: Roschke, Viktor 
APPLICANT: Li, Yi 
APPLICANT: Ruben, Steven, M. 

TITLE OF INVENTION: Human G-protein Chemokine Receptor (CCR5) HDGNR10 
FILE REFERENCE: 1488.115000C 

CURRENT APPLICATION NUMBER: US/ 09/779, 8 8 OA 
CURRENT FILING DATE: 2001-02-09 
PRIOR APPLICATION NUMBER: US 60/181,258 
PRIOR FILING DATE: 2000-02-09 
PRIOR APPLICATION NUMBER: US 60/187,999 
PRIOR FILING DATE: 2000-03-09 
PRIOR APPLICATION NUMBER: US 60/234,336 
PRIOR FILING DATE: 2000-09-22 
NUMBER OF SEQ ID NOS : 58 
SOFTWARE: Patentln version 3.0 
SEQ ID NO 22 
LENGTH: 352 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-779-880A-22 

Query Match 47.6%; Score 37 9; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.7e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

MD YQ VS SPIYDINYYTSEP CQKINVKQ I AA 30 

I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I 

MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I | I I I I I I I I I I I I I : I : : I 


Qy 

1 

Db 

1 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

79 

Db 

179 

Qy 

106 

Db 

239 


-LCTRSQKEGLHYTC 7 8 
: I I I I I I I I I I I I 


'PYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I! I I I I I I II :: I 


I I I I I I I I I II I I I I I 


RESULT 7 

US-09-813-653-15 

; Sequence 15, Application US/09813653 

; Patent No. US20020064770A1 

; GENERAL INFORMATION: 

; APPLICANT : Nestor, John 

; APPLICANT: Wilson, Carol 

; APPLICANT: See, Raymond 

; APPLICANT: Tan Hehir, Christina 

; TITLE OF INVENTION: Binding Compounds and Methods For Identifying Binding 
Compounds 

; FILE REFERENCE: CNS-005 

; CURRENT APPLICATION NUMBER: US/09/813,653 

CURRENT FILING DATE: 2001-03-20 
; PRIOR APPLICATION NUMBER: US 60/190,946 
; PRIOR FILING DATE: 2000-03-21 
; PRIOR APPLICATION NUMBER: US 60/190,996 
; PRIOR FILING DATE: 2000-03-21 
; PRIOR APPLICATION NUMBER: US 60/191,299 
; PRIOR FILING DATE: 2000-03-21 
; NUMBER OF SEQ ID NOS : 44 
; SOFTWARE: Patentln version 3.0 
; SEQ ID NO 15 
; LENGTH: 352 

TYPE : PRT 
; ORGANISM: Homo sapiens 
US-09-813-653-15 

Query Match 47.6%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.7e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 
Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 



Db 


1 MD YQVS S P I YDINYYT S EPCQKINVKQIAARLLP PLYSLVFI FGFVGNMLVI LI LINCKR 


60 


Qy 


31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 

I I I I I I I I I I I t I! I : I : : I 

61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 


64 


Db 


118 


Qy 



78 


Db 


178 


Qy 


79 S SHFP YSQYQFWKNFQTLKI HQRVHGG- 


105 



Db 


17 9 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 


238 


Qy 


Db 



RESULT 8 

US-09-813-653-17 

; Sequence 17, Application US/09813653 
; Patent No. US2 002 0064770A1 


GENERAL INFORMATION: 
APPLICANT: Nestor, John 
APPLICANT: Wilson, Carol 
APPLICANT: See, Raymond 
APPLICANT: Tan Hehir, Christina 

TITLE OF INVENTION: Binding Compounds and Methods For Identifying Binding 
Compounds 

FILE REFERENCE: CNS-005 

CURRENT APPLICATION NUMBER: US/ 0 9/ 8 13 , 653 
CURRENT FILING DATE: 2 001-03-20 
PRIOR APPLICATION NUMBER: US 60/190,946 
PRIOR FILING DATE: 2000-03-21 
PRIOR APPLICATION NUMBER: US 60/190,996 
PRIOR FILING DATE: 2000-03-21 
PRIOR APPLICATION NUMBER: US 60/191,299 
PRIOR FILING DATE: 2000-03-21 
NUMBER OF SEQ ID NOS : 44 
SOFTWARE: Patentln version 3.0 
SEQ ID NO 17 
LENGTH: 352 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-813-653-17 

Query Match 47.6%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.7e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

Qy 1 MDYQVSSPI YDINYYTSEPCQKINVKQIAA 3 0 

I I I I I I I I I II 1 I I I I I I I I I I I I I M 1 I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILIQINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYLAVVHAVFALKARTVTFGVVTSVITWvVAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I II II I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSS SNRLDGHQRV 135 

I I I II I I I I I I I I II II I : I 

Db 239 T IMI VYFLFWAP YNI VLLLNT FQEFFGLNNCS S SNRLDQAMQV 281 


RESULT 9 
US-09-796-202-1 

; Sequence 1, Application US/09796202 
; Patent No. US20020068813A1 
; GENERAL INFORMATION: 
; APPLICANT: Dragic, Tat j ana 
APPLICANT: Olson, William 
; TITLE OF INVENTION: SULFATED CCR5 PEPTIDES FOR HIV-1 INFECTION 


FILE REFERENCE: 2 04 8/61010/ JPW/SHS 
CURRENT APPLICATION NUMBER: US/ 09/7 9 6, 2 02 
CURRENT FILING DATE: 2001-02-28 
NUMBER OF SEQ ID NOS : 17 
SOFTWARE: Patentln version 3.0 
SEQ ID NO 1 
LENGTH: 352 
TYPE: PRT 
ORGANISM: human 
US-09-796-202-1 

Query Match 47.6%; Score 37 9; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.7e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

MDYQVS S P I YDINYYT SEPCQKINVKQI AA 30 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

MDYQVS S P I YD I N Y YT SEP CQKI NVKQ I AARL L P P L Y S LVF I FGFVGNMLVI L I L INC KR 60 

YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I II I I I I I I I I I I I : I : : I 

LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

LCTRSQKEGLHYTC 78 

: I I I I I I I I I I I I 

1 1 L LT I DRY LAWHAVFAL KART VT FGWT S VI T WWAVFAS LPGIIFTRSQKEG LH YT C 17 8 

SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I s s I 

S SHFPYSQYQFWKNFQTLKI VILGLVLPLLVMVI CYSGILKTLLRCRNEKKRHRAVRLI F 238 

GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

T IMI VYFLFWAPYNIVLLLNT FQEFFGLNNCS S SNRLDQAMQV 281 


Qy 

1 

Db 

1 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

79 

Db 

179 

Qy 

106 

Db 

239 


RESULT 10 
US-09-195-662A-2 

; Sequence 2, Application US/09195662A 

; Patent No. US2002007 6745A1 

; GENERAL INFORMATION: 

; APPLICANT: Li, Yi 

; APPLICANT: Ruben, Steven, M. 

; TITLE OF INVENTION: Human G-Protein Chemokine Receptor HDGNR10 (CCR5 
Receptor) 

; FILE REFERENCE: 1488.1150002 

; CURRENT APPLICATION NUMBER: US/ 09/195 , 662A 

; CURRENT FILING DATE: 1998-11-18 

; PRIOR APPLICATION NUMBER: 08/466,343 

PRIOR FILING DATE: 1995-06-06 
; NUMBER OF SEQ ID NOS: 9 

SOFTWARE: Patentln version 3.0 
; SEQ ID NO 2 
; LENGTH: 352 
TYPE: PRT 

ORGANISM: Artificial Sequence: Genomic 
; FEATURE: 


; OTHER INFORMATION: Deduced Amino Acid Sequence 
US-09-195-662A-2 


Query Match 47.6%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.7e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

Qy 1 MDYQVS S PI YDINYYT S EPCQKINVKQIAA 30 

I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

II I I I I I I I I II I I I : I :: I 

Db 61 LKSMTDI YLLNLAI S DLFFLLTVP FWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

Db 119 1 1 L LT I D R YLAWHAVFAL KART VT F GWT S VI T WWAVFAS LPGIIFTRSQKEGLH YT C 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I II I I I I I I I I I 1 I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 11 
US-09-339-912A-2 

; Sequence 2, Application US/09339912A 

; Patent No. US2 002009917 6A1 

; GENERAL INFORMATION: 

; APPLICANT: Li, Yi 

; APPLICANT: Ruben, Steven, M. 

TITLE OF INVENTION: Antibodies to Human G-Protein Chemokine Receptor 
HDGNR10 

; TITLE OF INVENTION: (CCR5 Receptor) 

; FILE REFERENCE: 1488.1150003 

; CURRENT APPLICATION NUMBER: US/ 09/339 , 9 12A 

; CURRENT FILING DATE: 1999-06-25 

; PRIOR APPLICATION NUMBER: 09/195,662 

; PRIOR FILING DATE: 1998-11-18 

; PRIOR APPLICATION NUMBER: 08/466,343 

; PRIOR FILING DATE: 1995-06-06 

; NUMBER OF SEQ ID NOS : 9 

; SOFTWARE: Patent In version 3.0 

; SEQ ID NO 2 

LENGTH: 352 
; TYPE: PRT 

ORGANISM: Artificial Sequence: Genomic 
; FEATURE: 

; OTHER INFORMATION: Deduced Amino Acid Sequence 
US-09-339-912A-2 


Query Match 47.6%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.7e-29; 


Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 


Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

II I II I I I I I I I I I I : I :: I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYLAWHAVFALKARTWFGVVTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I II I I I I I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLW1VICYSGILKTLLRCRNEKKRHRAVRL IF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I 1 I I I I II I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 12 
US-09-938-719-5 

; Sequence 5, Application US/09938719 
; Patent No. US2002 0106742A1 
; GENERAL INFORMATION: 

APPLICANT: SAMSON, MICHEL 

PARMENTIER, MARC 
; VASSART, GILBERT 

; LIBERT, FREDERICK 

; TITLE OF INVENTION: ACTIVE AND INACTIVE CC-CHEMOKINES RECEPTOR 

AND NUCLEIC ACID MOLECULES ENCODING SAID RECEPTOR 
NUMBER OF SEQUENCES: 17 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Knobbe, Martens, Olson & Bear 

STREET: 620 Newport Center Drive 16th Floor 
CITY: Newport Beach 
; STATE: CA 

; COUNTRY: U.S.A. 

ZIP: 92660 
COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/ 93 8 , 7 19 
; FILING DATE: 24-Aug-2001 

; CLASSIFICATION: <Unknown> 

PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/626,939 
; FILING DATE: 27-JULY-2000 

ATTORNEY/AGENT INFORMATION: 
; NAME: Altman, Daniel E 

REGISTRATION NUMBER: 34,115 


REFERENCE/ DOCKET NUMBER: <Unknown> 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 352 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-09-938-719-5 

Query Match 47.6%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.7e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I 1 I I I I I I I I I I I I I I I I II I I I I I I 

Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

II I I I I I I I I I I I I I : I :: I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 78 

: I I I I I I I I I I I I 

Db 119 1 1 LLT I D RYLAWHAVFAL KART VT FGWT S VI TWWAVFAS LPGIIFTRSQKE G LH YT C 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

II I I I I II I I I I I I I I I I I I : : I 

Db 17 9 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I II I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 2 81 


RESULT 13 
US-09-939-226-5 

; Sequence 5, Application US/09939226 
; Patent No. US20020110805A1 
; GENERAL INFORMATION: 
; APPLICANT : SAMSON, MICHEL 

; PARMENTIER, MARC 

VASSART, GILBERT 
; LIBERT, FREDERICK 

; TITLE OF INVENTION: ACTIVE AND INACTIVE CC-CHEMOKINES RECEPTOR 

AND NUCLEIC ACID MOLECULES ENCODING SAID RECEPTOR 
NUMBER OF SEQUENCES: 17 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Knobbe, Martens, Olson & Bear 

; STREET: 620 Newport Center Drive 16th Floor 

; CITY: Newport Beach 

STATE: CA 
COUNTRY: U.S.A. 
; ZIP: 92660 

; COMPUTER READABLE FORM: 

; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 


OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/ 939 , 22 6 
FILING DATE: 24-Aug-2001 
CLASSIFICATION : <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/626,939 
FILING DATE: 2000-07-27 
ATTORNEY/AGENT INFORMATION: 
NAME: Altman, Daniel E 
REGISTRATION NUMBER: 34,115 
REFERENCE/ DOCKET NUMBER: <Unknown> 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 352 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-09-939-226-5 

Query Match 47.6%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.7e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 1 MD YQVS S P I YD I NY YT S E PCQKI NVKQ I AA 30 

I I I I I I I I I I I I I I I I I I II I I I I I I 1 I II 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I II I I I 

Db 119 1 1 LLT I DRYIAVVHAVFALKARTVT FGWT SVI TWWAVFASLPGI I FTRSQKEGLH YTC 17 8 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCS S SNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 14 
US-09-938-703-5 

; Sequence 5, Application US/09938703 
; Patent No. US2002011087 0A1 

GENERAL INFORMATION: 
; APPLICANT: SAMSON, MICHEL 

PARMENTIER, MARC 
; VASSART, GILBERT 

LIBERT, FREDERICK 
TITLE OF INVENTION: ACTIVE AND INACTIVE CC-CHEMOKINES RECEPTOR 


AND NUCLEIC ACID MOLECULES ENCODING SAID RECEPTOR 
NUMBER OF SEQUENCES: 17 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Knobbe, Martens, Olson & Bear 
STREET: 620 Newport Center Drive 16th Floor 
CITY: Newport Beach 
STATE: CA 
COUNTRY: U.S.A. 
ZIP: 92660 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/ 938,7 03 
FILING DATE: 24-Aug-2001 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/626,939 
FILING DATE: 2000-07-27 
ATTORNEY/AGENT INFORMATION: 
NAME: Altman, Daniel E 
REGISTRATION NUMBER: 34,115 
REFERENCE/ DOCKET NUMBER: <Unknown> 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 352 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-09-938-703-5 

Query Match 47.6%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.7e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

[DYQVSSPI YDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I I 1 I I I I I I I I I I I I I M 


Qy 

1 

Db 

1 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

79 

Db 

179 

Qy 

106 

Db 

239 


1 1 1 1 1 1 1 1 1 1 1 1 I 


-LCTRSQKEGLHYTC 78 
: I I I I I I I I I I I I 


SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : I 


I I I I I I I I I I I I I I I I 


RESULT 15 
US-09-502-783A-2 

Sequence 2, Application US/09502783A 
Patent No. US20020132269A1 
GENERAL INFORMATION: 
APPLICANT: Li, Yi 
APPLICANT: Ruben, Steven M. 

TITLE OF INVENTION: Polynucleotides Encoding Human G-Protein Chemokine 
Receptor (CCR5) 

TITLE OF INVENTION: HDGNR10 
FILE REFERENCE: 1488.1150006 

CURRENT APPLICATION NUMBER: US/09/502 , 7 83A 
CURRENT FILING DATE: 2001-08-23 
PRIOR APPLICATION NUMBER: 08/466,343 
PRIOR FILING DATE: 1995-06-06 
NUMBER OF SEQ ID NOS : 9 
SOFTWARE: Patentln version 3.0 
SEQ ID NO 2 
LENGTH: 352 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-502-783A-2 

Query Match 47.6%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 1.7e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I II I II I I I I I I I I I I I I I I I I 
D b 1 MDYQVS S PI YDINYYTSEPCQKINVKQIAARLLPPLYSLVFI FGFVGNMLVILI LINCKR 60 

Q y 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

| | I I I I I I I I I I II I : I : : I 

D b 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 


QY 


6 5 LCTRSQKEGLHYTC 78 

: I I I I I I I I I I I I 

Db 119 1 1 LLT I D R Y LAWHAVFAL KART VT FGWT S VI T WWAVFAS LPGIIFTRS Q KE GLH YT C 178 


Qy 7 9 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

1 1 1 1 1 1 M 1 1 1 1 II I I II 1 1 : : i 

D b 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 2 39 T IMI VYFLFWAP YNI VLLLNTFQEFFGLNNCS S SNRLDQAMQV 2 81 


Search completed: March 5, 2004, 16:33:44 
Job time : 75.537 sees 


GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 


OM protein - protein search, using sw model 

Run on: March 5, 2004, 16:15:44 ; Search time 91.5741 Seconds 

(without alignments) 
475.479 Million cell updates/sec 


Title: 

Perfect score: 
Sequence : 


US-10-057-890A-10 
797 

1 MDYQVSSPIYDINYYTSEPC GLNNCSSSNRLDGHQRVHAA 138 


Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 


Searched: 


1017041 seqs, 315518202 residues 


Total number of hits satisfying chosen parameters: 


1017041 


Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 


Database : SPTREMBL_25 : * 


1: 


sp archea:* 

2: 


sp bacteria:* 

3 


sp fungi:* 

4 


sp human : * 

5 


sp invertebrate:* 

6 


sp mammal : * 

7 


sp mhc : * 

8 


sp organelle:* 

9 


sp phage:* 

10: 

sp plant:* 

lis 

sp rodent:* 

12 

sp virus:* 

13 

sp vertebrate:* 

14 

sp unclassified:* 

15 

: sp rvirus:* 

16 

: sp bacteriap:* 

17 

: sp archeap:* 


Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 


SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 


1 

373 

46. 

8 

352 

6 

018772 

018772 

pan troglod 

2 

370 

46. 

4 

344 

6 

Q9TQR8 

Q9tqr8 

cercocebus 

3 

370 

46. 

4 

344 

6 

077833 

077833 

cercocebus 

4 

369 

46. 

3 

352 

6 

Q95NC5 

Q95nc5 

hylobates s 

5 

369 

46. 

3 

352 

6 

Q9TV50 

Q9tv50 

pan troglod 

6 

367 

46. 

0 

352 

6 

Q95NC3 

Q95nc3 

miopithecus 

7 

367 

46. 

0 

352 

6 

018771 

018771 

pan troglod 

8 

366 

45. 

9 

352 

6 

Q9TSK1 

Q9tskl 

cercopithec 

9 

366 

45. 

9 

352 

6 

Q9TV49 

Q9tv4 9 

cercocebus 

10 

366 

45. 

9 

352 

6 

Q9XT13 

Q9xtl3 

papio anubi 

11 

366 

45. 

9 

352 

6 

Q9TV45 

Q9tv45 

cercopithec 

12 

366 

45. 

9 

352 

6 

Q95NE8 

Q95ne8 

cercopithec 

13 

365 

45. 

8 

352 

6 

Q9XT14 

Q9xtl4 

colobus gue 

14 

365 

45. 

8 

352 

6 

Q95NC6 

Q95nc6 

trachypithe 

15 

365 

45. 

8 

352 

6 

Q95NC8 

Q95nc8 

colobus pol 

16 

364 

45. 

7 

352 

6 

Q9XS99 

Q9xs99 

gorilla gor 

17 

363 

45. 

5 

352 

6 

097975 

097975 

macaca arct 

18 

363 

45. 

5 

352 

6 

Q9MZA2 

Q9mza2 

cercopithec 

19 

362 

45. 

4 

352 

6 

Q95NC1 

Q95ncl 

theropithec 

20 

362 

45. 

4 

352 

6 

Q95NC0 

Q95nc0 

hylobates m 

21 

362 

45. 

4 

352 

6 

Q9TV46 

Q9tv4 6 

cercopithec 

22 

362 

45. 

4 

352 

6 

097962 

097962 

pygathrix a 

23 

361 

45. 

3 

352 

6 

Q95NC7 

Q95nc7 

nasalis lar 

24 

361 

45. 

3 

352 

6 

Q9TV42 

Q9tv42 

cercopithec 

25 

361 

45. 

3 

352 

6 

077776 

077776 

cercocebus 

26 

361 

45. 

3 

352 

6 

Q95NE1 

Q95nel 

cercocebus 

27 

361 

45 

3 

352 

6 

Q9TQX0 

Q9tqx0 

cercopithec 

28 

359 

45 

0 

352 

6 

Q9TV43 

Q9tv43 

cercopithec 

29 

358 

44 

9 

352 

6 

018770 

018770 

pan troglod 

30 

357 

44 

8 

352 

6 

Q9TV47 

Q9tv47 

cercopithec 

31 

357 

44 

8 

352 

6 

Q9BGN5 

Q9bgn5 

cercopithec 

32 

357 

44 

8 

352 

6 

Q9XT12 

Q9xtl2 

cercopithec 

33 

357 

44 

8 

352 

6 

Q9TSQ7 

Q9tsq7 

cercopithec 

34 

356 

44 

.7 

352 

6 

Q95ND2 

Q95nd2 

mandrillus 

35 

356 

44 

.7 

352 

6 

Q95ND1 

Q95ndl 

mandrillus 

36 

355 

44 

.5 

352 

6 

Q9XT76 

Q9xt76 

cercopithec 

37 

355 

44 

.5 

352 

6 

Q9TV44 

Q9tv44 

cercopithec 

38 

355 

44 

.5 

352 

6 

Q9XS35 

Q9xs35 

macaca neme 

39 

355 

44 

.5 

352 

6 

Q9MZA3 

Q9mza3 

hylobates a 

40 

355 

44 

.5 

352 

6 

Q9TV93 

Q9tv93 

macaca arct 

41 

355 

44 

.5 

352 

6 

Q95ND0 

Q95nd0 

erythrocebu 

42 

346 

43 

.4 

352 

6 

Q9BGN6 

Q9bgn6 

cercopithec 

43 

345 

43 

.3 

339 

4 

Q9UN25 

Q9un25 

homo sapien 

44 

344 

43 

.2 

339 

4 

Q9UN23 

Q9un23 

homo sapien 

45 

344 

43 

.2 

339 

4 

Q9UN27 

Q9un27 

homo sapien 


ALIGNMENTS 


RESULT 1 
018772 

ID 018 772 PRELIMINARY; PRT; 352 AA. 

AC 018772; 

DT 01-JAN-1998 (TrEMBLrel. 05, Created) 

DT 01-JAN-1998 (TrEMBLrel. 05, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 


DE CCR5 receptor (Fragment) . 

GN CCR5 . 

OS Pan troglodytes (Chimpanzee) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Pan. 

OX NCBI_TaxID=9598; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=ChCCR5-142a; 

RA Zhang L., Carruthers CD., He T., Huang Y. , Cao Y. , Wang G. , Hahn B., 

RA Ho D.D. ; 

RT "HIV-l subtypes, co-receptor usage, and CCR5 polymorphism."; 

RL AIDS Res. Hum. Retroviruses 0:0-0(1997). 

DR EMBL; AF011541; AAB65741.1; -. 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW Receptor. 

FT NON_TER 352 352 

SQ SEQUENCE 352 AA; 40598 MW; 3968 8FA7004C952F CRC64 ; 

Query Match 46.8%; Score 373; DB 6; Length 352; 

Best Local Similarity 34.6%; Pred. No. 1.9e-33; 

Matches 98; Conservative 8; Mismatches 27; Indels 150; Gaps 7; 

Q y 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

| I I I I I I I I I i I : I I I I I M I I I 1 I I I I I I 
Db i MDYQVSSPIYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVDNMLVILILINCKR 60 


Qy 


31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

| | I I I I I I I I I I I I I : I : ' I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 118 


Qy 


65 LCTRSQKEGLHYTC 78 

: II I I I I I I I I I I 


Db H9 IILLT IDRYLAIVliAVFALKARTVTFGVVTSVITWVVAVFASLPGIIFTRSQKEGLHYTC 17 8 

Qy 7 9 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I II I I I I I I I : : 1 

SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 


Db 179 

Qy 


106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I | I I I I I I I I I M I I I i I : I 

Db 239 TIMIVYFLFWAPYDIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 2 
Q9TQR8 

ID Q9TQR8 PRELIMINARY; PRT; 344 AA. 

AC Q9TQR8; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 


DT 

DT 

DE 

GN 

OS 

OS 

OC 

OC 

OC 

OX 

RN 

RP 

RC 

RA 

RA 

RT 

RT 

RL 

RN 

RP 

RC 

RA 

RA 

RT 

RT 

RL 

RN 

RP 

RC 

RA 

RA 

RT 

RT 

RL 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 

SQ 


01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 
01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 
Chemokine receptor CCR5 . 
CCR5. 

Cercocebus torquatus torquatus, and 

Cercocebus torquatus atys (Red-crowned mangabey) (Sooty mangabey) . 
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 
Cercopithecinae; Cercocebus. 
NCBI_TaxID=81944 / 9531; 
[1] 

SEQUENCE FROM N.A. 

SPECIES=C. torquatus torquatus; STRAIN=1049, and 997; 

Chen Z., Kwon D., Jin Z., Monard S., Telfer P., Jones M. , Lu C, 

Aguilar R., Ho D.D., Marx P. A.; 

"Natural infection of a homozygous delta 24 CCR5 red-capped mangabey 
with a R2b-tropic simian immunodeficiency virus."; 
J. Exp. Med. 0:0-0(1998) . 
[2] 

SEQUENCE FROM N.A. 
SPECIES=C. torquatus atys; 

Palacios E. , Digilio L., McClure H.M., Chen Z., Marx P. A., 
Goldsmith M.A. , Grant R.M. ; 

"Parallel evolution of CCR5-null phenotypes in humans and in a natural 
host of simian immunodeficiency virus."; 
Curr. Biol. 0:0-0 (1998) . 
[3] 

SEQUENCE FROM N.A. 

SPECIES=C. torquatus torquatus; STRAIN=RCM411; 

Beer B.E., Kuiken C.L., Tooze Z., Foley B.T., Goeken R.M., Brown C.R., 
St Claire M. , Hirsch V.M. ; 

"Characterization of novel simian immunodeficiency viruses from 
redcapped mangabeys from Nigeria ( SIVrcmNgM411 and NgD409) ."; 
Submitted (FEB-2001) to the EMBL/ GenBank/DDBJ databases. 
EMBL; AF094753; AAC62474.1; 
EMBL; AF079473; AAC31194.1 
EMBL; AF084003; AAC62471.1 
EMBL; AF349683; AAK69685.1, 

GO; GO: 0016021; C: integral to membrane; IEA. 
GO; GO: 0004872; F: receptor activity; IEA. 

GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 
GO; GO: 0007186; P:G-protein coupled receptor protein signalin. 
InterPro; IPR000276; GPCR_Rhodpsn . 
Pfam; PF00001; 7tm_l; 1. 
PRINTS; PR00237; GPCRRHODOPSN . 
PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 
PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 
Receptor . 

SEQUENCE 344 AA; 39592 MW; E15F5F601191A4D1 CRC64; 


IEA. 


Query Match 4 6.4%; 

Best Local Similarity 34.9%; 
Matches 96; Conservative 


Score 370; DB 6; Length 344; 
Pred. No. 4e-33; 
9; Mismatches 28; Indels 142; Gaps 


Db 


1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I II I I I I I I I I : I I I I I I M I I I I I I I M 
1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 


Qy 


31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

| | I I I II II II I II I : I : : I 

Db 61 LKSMTDIYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 118 


QY 


65 LCTRSQKEGLHYTCSSHFPYSQ 8 6 

: I I I I : I I I I I I I I I I I I I I I 
Db H9 ULLTIDRYLAIVHAVFALKARTVTFGWAVFASLPGIIFTRSQREGLHYTCSSHFPYSQ 17 8 


Qy 87 YQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I : : I 

Db 179 YQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIFTIMIVYFL 238 

Qy 106 -GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I M I I I I I I I I I I I si 

Db 239 FWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 273 


RESULT 3 
077833 

ID 077833 PRELIMINARY; PRT; 344 AA. 

AC 077833; 

DT 01-NOV-1998 (TrEMBLrel. 08 , Created) 

DT 01-NOV-1998 (TrEMBLrel. 08, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Chemokine receptor CCR5 . 

GN CCR5 . 

OS Cercocebus torquatus torquatus . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 

OC Cercopithecinae; Cercocebus. 

OX NCBI_TaxID=81944; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=1208, and 009; 

RA Chen Z., Kwon D., Jin Z., Monard S., Telfer P., Jones M. , Lu C, 

RA Aguilar R., Ho D.D., Marx P. A.; 

RT "Natural infection of a homozygous delta 24 CCR5 red-capped mangabey 

RT with a R2b-tropic simian immunodeficiency virus."; 

RL J. Exp. Med. 0:0-0(1998). 

DR EMBL; AF094752; AAC62473.1; -. 

DR EMBL; AF084002; AAC62470.1; -. 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1__1 ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

SQ SEQUENCE 344 AA; 39578 MW; 0CEC05E47C2F6DE6 CRC64; 

Query Match 46.4%; Score 370; DB 6; Length 344; 

Best Local Similarity 34.9%; Pred. No. 4e-33; 

Matches 96; Conservative 9; Mismatches 28; Indels 142; Gaps 


Qy 


1 MD YQVS S P I YD IN Y YT S EP CQKI NVKQI AA- 


30 


I I I I I I I I I I I : I I I II I I I I I I I I I M I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFVFGFVGNILWLILINCKR 


Qy 


Qy 


60 


31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

| | I I I I I I I I M I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 


65 LCTRSQKEGLHYTCSSHFPYSQ 8 6 

: I I I I : I I I I I I I I I I I I I I I 
Db 119 1 1 LLTI DRYLAI VHAVFALKARTVT FGWAVFAS LPGI I FTRSQREGLHYTCS SHFP YSQ 178 


Qy 87 YQFWKNFQTLKI HQRVHGG 105 

I I I II I I I I I I I : : I 

Db 179 YQFWKNFQTLKIVT LGLVLPLLVMVICYSGI LKTLLRCRNEKKRHRAVRLI FTIMIVYFL 238 

Qy 106 -GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I M I I I I I I I I I M si 

Db 239 FWAP YNI VLLLNT FQEFFGLNNCS S SNRLDQAMQV 273 


RESULT 4 
Q95NC5 

ID Q95NC5 PRELIMINARY; PRT; 352 AA. 

AC Q95NC5; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE C-C chemokine receptor 5. 

GN CCR5 . 

OS Hylobates syndactylus (Siamang) ( Symphalangus syndactylus) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hylobatidae; Hylobates. 

OX NCBI_TaxID=9590; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Zhang Y., Ryder O.A., Zhang Y. ; 

RT "Sequence comparison of the CCR5 gene in primates and primate 

RT phylogeny . Tl ; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; AF177884; AAK43367.1; 

DR GO; GO: 0016021; C:integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 352 AA; 40508 MW; F4F64B3AD5AF658A CRC64; 

Query Match 46.3%; Score 369; DB 6; Length 352; 

Best Local Similarity 34.3%; Pred. No. 5.3e-33; 

Matches - 97; Conservative 8; Mismatches 28; Indels 150; Gaps 


Qy 


1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA- 
II II I I I I I I I : I I M I I I I I I I I II I I I 


30 


Db 


1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 


Q y 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

| | I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 78 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYLAIVHAVFALKARTVTFGWTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I M I I - I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMI VYFLFWAP YNI VLLLNTFQEFFGLNNCS S SNRLDQAMQV 281 


RESULT 5 
Q9TV50 

ID Q9TV5 0 PRELIMINARY; PRT; 352 AA. 

AC Q9TV50; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE CC chemokine receptor type 5. 

GN CCR5 . 

OS Pan troglodytes (Chimpanzee) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Pan. 

OX NCBI_TaxID=9598 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=1410; 

RX MEDLINE=99335215; PubMed=10408730 ; 

RA Muller-Trutwin M.C., Corbet S., Hansen J., Georges-Courbot M.C., 

RA Diop 0., Rigoulet J . , Barre-Sinoussi F . , Fomsgaard A. ; 

RT "Mutations in CCR5-coding sequences are not associated with SIV 

RT carrier status in African nonhuman primates."; 

RL AIDS Res. Hum. Retroviruses 15:931-939(1999). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=1410; 

RA Mueller-Trutwin M.C., Corbet S., Hansen J., Diop O., 

RA Georges-Courbot M.-C, Barre-Sinoussi F., Fomsgaard A.; 

RL Submitted (NOV-1997) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF035214; AAD44007.1; -. 

DR GO; GO: 0016021; C:integral to membrane; IEA. 

DR GO; GO: 0004872; F:receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G PR0TEIN_RECEP_F1_2 ; 1. 


KW Receptor. 

SQ SEQUENCE 352 AA; 40481 MW; 257 8A0F2C07A4A65 CRC64; 


Query Match 46.3%; Score 369; DB 6; Length 352; 

Best Local Similarity 34.3%; Pred. No. 5.3e-33; 

Matches 97; Conservative 8; Mismatches 28; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I 1 I I I I I I : I I I I I I II I I I I I I I I I 
D b 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I | I I I I I I I I I I I I I : I : : I 

D b 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

Db 119 1 1 LLS I DRYLAI VHAVFALKARTVT FGWT S VI TWVVAVFAS LPGI I FT RS QKEGLH YT C 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : I 

D b 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I II I I II I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 6 
Q95NC3 

ID Q95NC3 PRELIMINARY; PRT; 352 AA. 

AC Q95NC3; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE C-C chemokine receptor 5. 

GN CCR5 . 

OS Miopithecus talapoin (Talapoin) ( Cercopithecus talapoin) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 

OC Cercopithecinae; Miopithecus. 

OX NCBI_TaxID=36231; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Zhang Y., Ryder O.A., Zhang Y. ; 

RT "Sequence comparison of the CCR5 gene in primates and primate 

RT phylogeny."; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; AF177886; AAK43369.1; -. 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G PROTEIN RECEP Fl 1; 1. 


DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 
KW Receptor. 

SQ SEQUENCE 352 AA; 40546 MW; 6464152F3E566AE5 CRC64; 

Query Match 46.0%; Score 367; DB 6; Length 352; 

Best Local Similarity 33.9%; Pred. No. 8.9e-33; 

Matches 96; Conservative 9; Mismatches 28; Indels 150; Gaps 7 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
D b 1 MDYQVSSPTYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 


31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

| | I I I II I I I I I I I : : I : = I 

Db 61 LKSMTDIYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC— RLLTGLYFIGFFSGIFF 118 


Qy 65 LCTRSQKEGLHYTC 7 8 

: II I I : I I I I I I I 

Db 119 IILLTIDRYLAIVTiAVFALKARTWFGWTSVITWWAVFASLPGIIFTRSQREGLHYTC 178 


Qy 


79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I II I I I I I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 


Q y 106 GGSYKCGLC QEFFGLNNCSS SNRLDGHQRV 135 

I I I I II I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 7 
018771 

ID 018771 PRELIMINARY; PRT; 352 AA. 

AC 018771; 

DT 01-JAN-1998 (TrEMBLrel. 05, Created) 

DT 01-JAN-1998 (TrEMBLrel. 05, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE CCR5 receptor (Fragment) . 

GN CCR5 . 

OS Pan troglodytes (Chimpanzee) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Pan. 

OX NCBI_TaxID=9598 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=ChCCR5- 14 la ; 

RA Zhang L., Carruthers CD., He T., Huang Y., Cao Y., Wang G. , Hahn B., 

RA Ho D.D. ; 

RT "HIV-l subtypes, co-receptor usage, and CCR5 polymorphism."; 

RL AIDS Res. Hum. Retroviruses 0:0-0(1997). 

DR EMBL; AF011539; AAB65739.1; 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 


DR PROSITE; PS00237; G_PROTEIN__RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PROTEIN__RECEP_Fl_2 ; 1. 

KW Receptor. 

FT NONJTER 352 352 

SQ SEQUENCE 352 AA; 4 04 66 MW; 3 F FFAC 7 ABAE 1 D 4 FB CRC64; 

Query Match 46.0%; Score 367; DB 6; Length 352; 

Best Local Similarity 34.3%; Pred. No. 8.9e-33; 

Matches 97; Conservative 8; Mismatches 28; Indels 150; Gaps 7 

Q y 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I : I I I I M I I I I I I I I I I I 
Db i MDYQVSSPIYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 


Qy 


31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

| | I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISNLFFLLTVPFWAHYAAAQWDFGNTMC QLLTGLYFIGFFSGIFF 118 


Qy 


Qy 


65 LCTRSQKEGLHYTC 7 8 

: Mill I II I I I 


Db H9 1 1 LLT I DRYIA.I VHAVFALKARTVT FGWT S VI TWWAVFAS LPGI I FTRSQKGGLH YT C 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I :: 1 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 


106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

| I I I I I I I I I I I I I I I M : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 2 81 


RESULT 8 
Q9TSK1 

ID Q9TSK1 PRELIMINARY; PRT; 352 AA. 

AC Q9TSK1; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE G-protein coupled chemokine receptor. 

OS Cercopithecus aethiops (Green monkey) (Grivet) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 

OC Cercopithecinae; Cercopithecus. 

OX NCBI_TaxID=9534 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Holtkamp N., Baier M., Werner A.; 

RL Submitted (AUG-1997) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF019379; AAD01639.1; -. 

DR GO; GO: 0016021; C:integral to membrane; IEA. 

DR GO; GO: 0004872; F:receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN__RECEP_Fl_l ; 1. 


DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 
KW Receptor. 

SQ SEQUENCE 352 AA; 40588 MW; 0F1869D9A6668DBB CRC64; 

Query Match 45.9%; Score 366; DB 6; Length 352; 

Best Local Similarity 33.9%; Pred. No. 1.2e-32; 

Matches 96; Conservative 9; Mismatches 28; Indels 150; Gaps 7 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I : I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHH S YKCG 64 

II I I 11 I I I II I I I I : I :: I 

Db 61 LKSMTDIYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: 1111:1111111 

Db 119 IILLTIDRYLAIVHAVFALKARTVTFGVW 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I II I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVKVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 9 
Q9TV4 9 

ID Q9TV49 PRELIMINARY; PRT; 352 AA. 

AC Q9TV4 9; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE CC chemokine receptor type 5 (C-C chemokine receptor 5) . 

GN CCR5 . 

OS Cercocebus galeritus (Agile mangabey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Cercocebus. 

OX NCBI_TaxID=9532 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=4; 

RX MEDLINE=99335215; PubMed=104 08730 ; 

RA Muller-Trutwin M.C., Corbet S., Hansen J. , Georges-Courbot M.C., 

RA Diop O., Rigoulet J., Barre-Sinoussi F. , Fomsgaard A. ; 

RT "Mutations in CCR5-coding sequences are not associated with SIV 

RT carrier status in African nonhuman primates."; 

RL AIDS Res. Hum. Retroviruses 15:931-939(1999). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=4; 

RA Mueller-Trutwin M.C., Corbet S., Hansen J., Diop 0., 

RA Georges-Courbot M.-C, Barre-Sinoussi F. , Fomsgaard A.; 


RL Submitted (NOV-1997) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Zhang Y. , Ryder O.A., Zhang Y. ; 

RT "Sequence comparison of the CCR5 gene in primates and primate 

RT phylogeny . " ; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF035215; AAD44008.1; 

DR EMBL; AF177898; AAK43381.1; 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO:0007186; P:G-protein coupled receptor protein signalin. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1__2 ; 1. 

KW Receptor. 

SQ SEQUENCE 352 AA; 40479 MW; 5E1504A9B8727 8B2 CRC64; 


IEA. 


Query Match 45.9%; 
Best Local Similarity 33.9%; 
Matches 96; Conservative 


Score 366; DB 6; Length 352; 
Pred. No. 1.2e-32; 
9; Mismatches 28; Indels 150; 


Gaps 


Qy 

Db 

Qy 

Db 


1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I II : I I I I I I I I I I I I I I I II 
1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I II I I I I I I : I : : I 

61 LKSMTDIYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 


Qy 

Db 

Qy 

Db 

Qy 

Db 


65 


119 


79 


LCTRSQKEGLHYTC 7 8 

: 1111:1111111 

IILLTIDRYLAIVHAVFALKARTWFGVVTSVITWWAVFASLPGIIFTRSQREGLHYTC 17 8 


-HQRVHGG 105 


SSHFPYSQYQFWKNFQTLKI 

I I I I I I I I I I I I I I I I I I I I : : I 

179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 


106 


-GGSYKCGLC- 


135 


QEFFGLNNCSSSNRLDGHQRV 

I I I I I I I I I I I I I I I II I : I 

239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 10 
Q9XT13 

ID Q9XT13 PRELIMINARY; PRT; 352 AA. 

AC Q9XT13; 

DT 01-NOV-1999 (TrEMBLrel. 12, Created) 

DT 01-NOV-1999 (TrEMBLrel. 12, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE CC chemokine receptor 5. 

GN CCR5 . 

OS Papio anubis (Olive baboon) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 


OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 

OC Cercopithecinae; Papio. 

OX NCBI_TaxID=9555; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Spearman P.W., Mburu D.N./ Graham B.S.; 

RT "Differential Utilization of CCR5 Molecules from Three East African 

RT Simian Species by the HIV-1 Envelope Glycoprotein."; 

RL Submitted (APR-1999) to the EMBL/GenBank/ DDB J databases. 

DR EMBL; AF141640; AAD32685.1; -. 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO:0004872; F:receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP__Fl_l ; 1. 

DR PROSITE; PS50262; G_PROTEIN__RECEP_Fl_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 352 AA; 40489 MW; 0B47E337C1 1E2E1E CRC64; 

Query Match 45.9%; Score 366; DB 6; Length 352; 

Best Local Similarity 33.9%; Pred. No. 1.2e-32; 

Matches 96; Conservative 9; Mismatches 28; Indels 150; Gaps 7 

Qy 1 MD YQVS S P I YD I N Y YT S EP CQK I NVKQI AA 30 

I I I I I I I I I II : I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

II I I II I I I I II I I I : I :: I 

Db 61 LKSMTDNYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 78 

: I I I I : I I I I I i I 

Db 119 IILLTIDRYLAIvliAVFALKARTVTFGV^SVITWWAVFASLPGIIFTRSQREGLHYTC 17 8 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I II I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 11 
Q9TV45 

ID Q9TV4 5 PRELIMINARY; PRT; 352 AA. 

AC Q9TV4 5; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE CC chemokine receptor type 5. 

GN CCR5 . 

OS Cercopithecus nictitans (white-nosed guenon) . 


OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Cercopithecus . 

OX NCBI_TaxID=3622 8; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=7; 

RX MEDLINE=99335215; PubMed=10408730 ; 

RA Muller-Trutwin M.C., Corbet S., Hansen J., Georges-Courbot M.C., 

RA Diop O., Rigoulet J., Barre-Sinoussi F., Fomsgaard A.; 

RT "Mutations in CCR5-coding sequences are not associated with SIV 

RT carrier status in African nonhuman primates."; 

RL AIDS Res. Hum. Retroviruses 15:931-939(1999). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=7; 

RA Mueller-Trutwin M.C., Corbet S., Hansen J., Diop 0., 

RA Georges-Courbot M.-C, Barre-Sinoussi F. , Fomsgaard A.; 

RL Submitted (NOV-1997) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF035219; AAD44012.1; -. 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN. 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 352 AA; 40658 MW; BC665891ED5661F8 CRC64; 

Query Match 45.9%; Score 366; DB 6; Length 352; 

Best Local Similarity 33.9%; Pred. No. 1.2e-32; 

Matches 96; Conservative 9; Mismatches 28; Indels 150; Gaps 7 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I : I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSS PTYDI DYYTSEPCQKINVKQIAARLLPLLYSLVFI FGFVGNI LWLI LINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

II I I I I I I I I I I I I I : I I 

Db 61 LKSMTDIYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I : I I I I I I I 

Db 119 IILLTIDRYLAIVmVFALKARTVTFGVVTSVITWWAVFASLPRIIFTRSQREGLHYT 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVMLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I II I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 2 81 


RESULT 12 
Q95NE8 

ID Q95NE8 PRELIMINARY; PRT; 352 AA. 

AC Q95NE8; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE CC chemokine receptor type 5. 

GN CCR5 . 

OS Cercopithecus tantalus. 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Cercopithecus . 

OX NCBI_TaxID=60712; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99335215; PubMed=10408730 ; 

RA Muller-Trutwin M.C., Corbet S., Hansen J., Georges-Courbot M.C., 

RA Diop 0., Rigoulet J., Barre-Sinoussi F . , Fomsgaard A.; 

RT "Mutations in CCR5-coding sequences are not associated with SIV 

RT carrier status in African nonhuman primates."; 

RL AIDS Res. Hum. Retroviruses 15:931-939(1999). 

DR EMBL; AF081577; AAD45495.1; 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn. 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

SQ SEQUENCE 352 AA; 40576 MW; 0F1869D9B7968DBB CRC64; 

Query Match 45.9%; Score 366; DB 6; Length 352; 

Best Local Similarity 33.9%; Pred. No. 1.2e-32; 

Matches 96; Conservative 9; Mismatches 28; Indels 150; Gaps 7 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I : I II I I I I I I I I I I I I I I 
Db 1 MDYQVS S PT YDI DYYT SEPCQKINVKQI AARLLPPLYSLVFI FGFVGNI LWLI LINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

II II I I I I I I I II I I : I :: I 

Db 61 LKSMTDIYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I : I I I I I I I 

Db 119 1 1 L LT I D RYLAI VHAVFAL KART VT FGWT S VI T WWAVFAS L P RI I FT RS QREGLH YT C 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I II I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 13 
Q9XT14 

ID Q9XT14 PRELIMINARY; PRT; 352 AA. 

AC Q9XT14; 

DT 01-NOV-1999 (TrEMBLrel. 12, Created) 

DT 01-NOV-1999 (TrEMBLrel. 12, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE CC chemokine receptor 5. 

GN CCR5 . 

OS Colobus guereza (Black-and-white colobus monkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; Colobinae; 

OC Colobus. 

OX NCBI_TaxID=33548; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Spearman P.W., Mburu D.N., Graham B.S.; 

RT "Differential Utilization of CCR5 Molecules from Three East African 

RT Simian Species by the HIV-1 Envelope Glycoprotein."; 

RL Submitted (APR-1999) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF141639; AAD32684.1; 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F : rhodopsin-like receptor activity; IEA. 

DR GO; GO:0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 352 AA; 40550 MW; 9B07 8EF04D34DB36 CRC64; 

Query Match 45.8%; Score 365; DB 6; Length 352; 

Best Local Similarity 33.6%; Pred. No. 1.5e-32; 

Matches 95; Conservative 10; Mismatches 28; Indels 150; Gaps 7 

MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I 1 I I I I I I I I I : I I I I I I I I I : I I I I I I I 

MDYQVSSPTYDIDYYTSEPCQKVNVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

YKCGLC AAAQWDFGNTMCQHQRVHGHHHH S YKCG 64 

I I I I I I I I I I I I I I I : I : : I 

LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 

LCTRSQKEGLHYTC 78 

: I I I I : II I I I I I 

1 1 L LT I D R Y LAI VHAVFAL KART AT F GWT S VI T WWAVFAS LPGIIFTRSQREGLH YT C 17 8 

SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I II I I I I I :s I 

SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 23 8 


I II I I M I I I I I I I I 


Qy 

1 

Db 

1 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

79 

Db 

179 

Qy 

106 

Db 

239 


RESULT 14 
Q95NC6 

ID Q95NC6 PRELIMINARY; PRT; 352 AA. 

AC Q95NC6; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT Ol-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE C-C chemokine receptor 5. 

GN CCR5 . 

OS Trachypithecus johnii (hooded leaf monkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; Colobinae; 

OC Trachypithecus . 

OX NCBI_TaxID=66063; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Zhang Ryder O.A., Zhang Y. ; 

RT "Sequence comparison of the CCR5 gene in primates and primate 

RT phylogeny . " ; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF177883; AAK43366.1; -. 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS 5 02 62; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 352 AA; 40462 MW; 52824E0322559F7F CRC64; 


Query Match 45.8%; Score 365; DB 6; Length 352; 

Best Local Similarity 33.6%; Pred. No. 1.5e-32; 

Matches 95; Conservative 10; Mismatches 28; Indels 150; 


Gaps 


Qy 

i 

Db 

i 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

79 

Db 

179 

Qy 

106 


MDYQVSSPIYDINYYTSEPCQKINVKQIAA 

I t I I I I I I I I I : I I I I I I I I I : I I I I II I 

MDYQVSSPTYDIDYYTSEPCQKVNVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 


30 


60 


YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 


-LCTRSQKEGLHYTC 78 
: I I I I : I I I I I I I 


I I 


I I I 


I I 


I I I 


-HQRVHGG- 
:: I 


105 


I I I I I I I I I I I I I I I I : I 


Db 


239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCS S SNRLDQAMQV 281 


RESULT 15 

Q95NC8 

ID Q95NC8 


PRELIMINARY; 


PRT; 


352 AA. 


AC Q95NC8; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE C-C chemokine receptor 5. 

GN CCR5 . 

OS Colobus polykomos. 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; Colobinae; 

OC Colobus . 

OX NCBI_TaxID=9572 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Zhang Y., Ryder O.A., Zhang Y.; 

RT "Sequence comparison of the CCR5 gene in primates and primate 

RT phylogeny. " ; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF177881; AAK43364.1; -. 

DR GO; GO: 0016021; Crintegral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 352 AA; 40578 MW; 4366F149C3B4938F CRC64 ; 

Query Match 45.8%; Score 365; DB 6; Length 352; 

Best Local Similarity 33.6%; Pred. No. 1.5e-32; 

Matches 95; Conservative 10; Mismatches 28; Indels 150; Gaps 7 
Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 


I I I I I I I i i i i • i i i i i i i i i ♦ i 

1 MD YQVS S P T YD I D Y YT SEP CQKVNVKQ I AARL L P P L YS LVF I FGFVGN I LWL I L I N C KR 60 



Db 


Qy 


Db 


31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I II I I : I : : I 

61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 


Qy 


Db 


65 LCTRSQKEGLHYTC 7 8 

: I II I : II I I I I I 

119 1 1 LLT I DRYLAI VHAVFALKART VT FGWT S VI TWWAVFAS L PGI I FT RSQREGLH YTC 178 


Qy 


7 9 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 



Db 


179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 


Qy 


106 


GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 


I I I I I I I I I I I I I I I I I I : I 

Db 239 T IMI VYFLFWAP YNI VLLLNTFQEFFGLNNCS S SNRLDQAMQV 28 


Search completed: March 5, 2004, 16:27:31 
Job time : 93.5741 sees 


GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 


OM protein - protein search, using sw model 


Run on: 


Title: 

Perfect score: 
Sequence : 


March 5, 2004, 16:15:14 ; Search time 18.7407 Seconds 

(without alignments) 
383.426 Million cell updates/sec 

US-10-057-890A-10 
797 

1 MDYQVSSPIYDINYYTSEPC GLNNCSSSNRLDGHQRVHAA 138 


Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 141681 seqs, 52070155 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 


141681 


Database 


SwissProt 42:* 


Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 


Result 


Query 


No. 

Score 

Match Length 

DB 

ID 

Description 

1 

379 

47. 

6 

352 

1 

CKR5_HUMAN 

P51681 

homo sapien 

2 

374 

46. 

9 

352 

1 

CKR5 PANTR 

P56440 

pan troglod 

3 

369 

46. 

3 

352 

1 

CKR5_HYLSY 

Q95nc5 

hylobates s 

4 

369 

46. 

3 

352 

1 

CKR5 PONPY 

097881 

pongo pygma 

5 

366 

45. 

9 

352 

1 

CKR5 PAPHA 

P56441 

papio hamad 

6 

365 

45. 

8 

352 

1 

CKR5_PYGBI 

097880 

pygathrix b 

7 

365 

45. 

8 

352 

1 

CKR5_PYGNE 

097882 

pygathrix n 

8 

365 

45. 

8 

352 

1 

CKR5_TRAFR 

097878 

trachypithe 

9 

365 

45. 

8 

352 

1 

CKR5 TRAPH 

097879 

trachypithe 

10 

364 

45. 

7 

352 

1 

CKR5_GORGO 

P56439 

gorilla gor 

11 

363 

45. 

5 

352 

1 

CKR5_MACMU 

P79436 

macaca mula 

12 

362 

45. 

4 

352 

1 

CKR5 HYLML 

Q95nc0 

hylobates m 

13 

361 

45. 

3 

352 

1 

CKR5_CERPY 

Q9tv42 

cercopithec 

14 

361 

45. 

3 

352 

1 

CKR5_CERTO 

062743 

cercocebus 

15 

359 

45. 

0 

352 

1 

CKR5_HYLLE 

097883 

hylobates 1 

16 

357 

44. 

8 

352 

1 

CKR5 CERAE 

P56493 

cercopithec 

17 

272 

34. 

1 

354 

1 

CKR5 MOUSE 

P51682 

mus musculu 


18 

249 

31. 

2 

354 

1 

CKR5__RAT 

008556 

rattus norv 

19 

156. 5 

19. 

6 

474 

1 

Z256 HUMAN 

Q9y2p7 

homo 

sapien 

20 

154 

19. 

3 

201 

1 

Z239 MOUSE 

P24399 

mus musculu 

21 

151.5 

19. 

0 

645 

1 

Z235 MOUSE 

Q61116 

mus musculu 

22 

151.5 

19. 

0 

754 

1 

Z287_HUMAN 

Q9hbt7 

homo 

sapien 

23 

150.5 

18 . 

9 

501 

1 

ZF9 6__MOUSE 

Q9zld7 

mus musculu 

24 

146.5 

18. 

4 

458 

1 

Z239 HUMAN 

Q16600 

homo 

sapien 

25 

144.5 

18. 

1 

759 

1 

Z287J40USE 

Q9eqb9 

mus musculu 

26 

144 

18. 

1 

698 

1 

Z234 HUMAN 

Q14588 

homo 

sapien 

27 

140.5 

17. 

6 

604 

1 

Z305 HUMAN 

043309 

homo 

sapien 

28 

140. 5 

17. 

6 

744 

1 

YJ62 HUMAN 

Q8tf39 

homo 

sapien 

29 

140. 5 

17. 

6 

803 

1 

Z226 HUMAN 

Q9nyt6 

homo 

sapien 

30 

140. 5 

17 . 

6 

913 

1 

Z228_HUMAN 

Q9uju3 

homo 

sapien 

31 

139. 5 

17 . 

5 

578 

1 

Z192 HUMAN 

Q15776 

homo 

sapien 

32 

139.5 

17 . 

5 

682 

1 

ZN4 5 HUMAN 

Q02386 

homo 

sapien 

33 

139.5 

17 . 

5 

751 

1 

Z18 4__HUMAN 

Q99676 

homo 

sapien 

34 

139 

17 . 

4 

810 

1 

Z33A HUMAN 

Q06730 

homo 

sapien 

35 

138 . 5 

17 . 

4 

468 

1 

ZF90 HUMAN 

Q8tf47 

homo 

sapien 

36 

138 . 5 

17 . 

4 

670 

1 

ZN16 HUMAN 

P17020 

homo 

sapien 

37 

138 . 5 

17 . 

4 

1029 

1 

Z197 HUMAN 

014709 

homo 

sapien 

38 

138 

17 . 

3 

463 

1 

Z331 HUMAN 

Q9nqx6 

homo 

sapien 

39 

136. 5 

17 . 

1 

348 

1 

Z134 HUMAN 

P52741 

homo 

sapien 

40 

136. 5 

17. 

1 

659 

1 

Z304_HUMAN 

Q9hcx3 

homo 

sapien 

41 

136.5 

17. 

1 

825 

1 

ZF28 MOUSE 

P10078 

mus musculu 

42 

135.5 

17. 

0 

643 

1 

ZN74_HUMAN 

Q16587 

homo 

sapien 

43 

133.5 

16. 

8 

636 

1 

ZF90 MOUSE 

Q61967 

mus musculu 

44 

132.5 

16. 

6 

642 

1 

ZN14 HUMAN 

P17017 

homo 

sapien 

45 

131.5 

16. 

5 

367 

1 

Z211 HUMAN 

Q13398 

homo 

sapien 


ALIGNMENTS 


RESULT 1 

CKR5_HUMAN 

ID CKR5_HUMAN STANDARD; PRT; 352 AA. 

AC P51681; 014692; 014693; 014695; 014696; 014697; 014698; 014699; 

AC 014700; 014701; 014702; 014703; 014704; 014705; O14706; 014707; 

AC 014708; 015538; Q9UPA4; 

DT 01-OCT-1996 (Rel. 34, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) 

DE (HIV-1 fusion co-receptor) (CHEMR13) (CD195 antigen) . 

GN CCR5 OR CMKBR5. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=96241590; PubMed=8 639485 ; 

RA Samson M. , Labbe O., Mollereau C, Vassart G., Parmentier M. ; 

RT "Molecular cloning and functional expression of a new human 

RT CC-chemokine receptor gene . " ; 

RL Biochemistry 35:3362-3367(1996). 

RN [2] 


RP SEQUENCE FROM N.A. 

RX MEDLI NE= 96291862; PubMed= 8 663314; 

RA Raport C.J., Gosling J., Schweichart V.L., Gray P.W., Charo I.F.; 

RT "Molecular cloning and functional characterization of a novel human 

RT CC chemokine receptor (CCR5) for RANTES , MlP-lbeta, and MlP-lalpha . " ; 

RL J. Biol. Chem. 271:17161-17166(1996). 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=96295970; PubMed=8699119 ; 

RA Combadiere C, Ahuja S.K., Tiffany H.L., Murphy P.M.; 

RT "Cloning and functional expression of CC CKR5, a human monocyte CC 

RT chemokine receptor selective for MIP-1 ( alpha) , MlP-l(beta), and 

RT RANTES."; 

RL J. Leukoc. Biol. 60:147-152(1996). 

RN [4] 

RP SEQUENCE FROM N.A. 

RA McCorabie W.R., Wilson R., Chen E., Gibbs R. , Zuo L., Johnson D., 

RA Nhan M. , Parnell L., Dedhia N., Ansari A., Mardis E. , Schutz K. , 

RA Gnoj L., la Bastide M. , Kaplan N., Greco T., Touchman J . , 

RA Muzny D., Chen C.N., Evans C. , Fitzgerald M. , See L . H . , Tang M. , 

RA Porcel B .M. , Dragan Y. , Giacalone J., Pae A., Powell E., 

RA Solinsky K.A. , Desilva U., Diaz-Perez S., Zhou X., Yu Y., 

RA Watanabe M. , Doggett N., Garcia D., Sagripanti J.L.; 

RL Submitted (MAY-1997) to the EMBL/ GenBank/DDBJ databases. 

RN [5] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98001387; PubMed=9343222 ; 

RA Kuhmann S.E., Piatt E.J., Kozak S.L., Kabat D.; 

RT "Polymorphisms in the CCR5 genes of African green monkeys and mice 

RT implicate specific amino acids in infections by simian and human 

RT immunodeficiency viruses."; 

RL J. Virol. 71:8642-8656(1997). 

RN [6] 

RP SEQUENCE FROM N.A., AND POLYMORPHISMS. 

RX MEDLINE=98022612; PubMed=9359654 ; 

RA Zhang L. , Carruthers CD., He T., Huang Y., Cao Y., Wang G., Hahn B . , 

RA Ho D.D.; 

RT "HIV type 1 subtypes, coreceptor usage, and CCR5 polymorphism."; 

RL AIDS Res. Hum. Retroviruses 13:1357-1366(1997). 

RN [7] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98049523; PubMed=93882 01 ; 

RA Mummidi S., Ahuja S.S., McDaniel B.L., Ahuja S.K.; 

RT "The human CC chemokine receptor 5 (CCR5) gene. Multiple transcripts 

RT with 5 f -end heterogeneity, dual promoter usage, and evidence for 

RT polymorphisms within the regulatory regions and noncoding exons."; 

RL J. Biol. Chem. 272:30662-30671(1997). 

RN [8] 

RP SEQUENCE FROM N.A., AND VARIANT ARG-17 8. 

RA Magierowska M. , Barre-Sinoussi F., Issafras H., Theodorou I., 

RA Debre P. ; 

RL Submitted (MAR-1998) to the EMBL/ GenBank/DDBJ databases. 

RN [9] 

RP SEQUENCE FROM N.A. 

RA Kopatz S.A., Aronstam R.S., Sharma S.V.; 

RT "cDNA clones of human proteins involved in signal transduction 

RT sequenced by the Guthrie cDNA resource center (www.cdna.org)."; 


RL Submitted (JAN-2003) to the EMBL/ GenBank/DDB J databases. 

RN [10] 

RP CHARACTERIZATION OF ITS HIV-1 RECEPTOR FUNCTION. 

RX MEDLINE=96260017; PubMed=8 64 9511 ; 

RA Deng H., Liu R. , Ellmeier W., Choe S. f Unutmaz D., Burkhart M. , 

RA di Marzio P., Marmon S., Sutton R.E., Hill CM., Davis C.B., 

RA Peiper S.C., Schall T.J., Littman D.R., Landau N.R.; 

RT "Identification of a major co-receptor for primary isolates of 

RT HIV-1."; 

RL Nature 381:661-666(1996). 

RN [11] 

RP CHARACTERIZATION OF ITS HIV-1 RECEPTOR FUNCTION. 

RX MEDLINE=96260018; PubMed=864 9512 ; 

RA Dragic T., Litwin V., Allaway G.P., Martin S.R., Huang Y. , 

RA Nagashima K.A., Cayanan C, Maddon P. J., Koup R.A. , Moore J. P., 

RA Paxton W.A. ; 

RT "HIV-1 entry into CD4 + cells is mediated by the chemokine receptor 

RT CC-CKR-5."; 

RL Nature 381:667-673(1996). 

RN [12] 

RP SULFATION. 

RX MEDLINE-99189752; PubMed=10089882 ; 

RA Farzan M., Mirzabekov T., Kolchinsky P., Wyatt R., Cayabyab M. , 

RA Gerard N.P., Gerard C, Sodroski J., Choe H. ; 

RT "Tyrosine sulfation of the amino terminus of CCR5 facilitates HIV-1 

RT entry."; 

RL Cell 96:667-676(1999). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 

CC MIP-1-beta and rantes and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation. Acts as co-receptor with CD4 for primary non- 

CC syncytium-inducing strains (NSI) (macrophage-tropic) of HIV-1 

CC virus. It promotes Env-mediated fusion of the virus. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- TISSUE SPECIFICITY: Found in promyelocytic cells. 

CC -!- PTM: Sulfation contributes to the efficiency of HIV-1 entry. 

CC -!- PTM: Modified by O-linked glycosylation, but not by N-linked 

CC glycosylation. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 


CC 



DR 

EMBL; 

X91492; CAA62796.1; -. 

DR 

EMBL; 

U54994; AAC50598.1; -. 

DR 

EMBL; 

U57840; AAB17071.1; -. 

DR 

EMBL; 

U95626; AAB57793.1; 

DR 

EMBL; 

U83326; AAC51797.1; -. 

DR 

EMBL; 

AF011500; AAB65700.1; 

DR 

EMBL; 

AF011501; AAB65701.1; -. 

DR 

EMBL; 

AF011502; AAB65702.1; -. 


DR 

EMBL 

i AF011503, 
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1 


DR 

EMBL 

; AF011505, 
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DR 

EMBL 

\ AF011506, 
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. _ 

DR 

EMBL, 
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DR 
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• AF011508, 
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DR 
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• AF011509, 
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DR 

EMBL, 

• AF011510, 
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DR 

EMBL, 
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DR 

EMBL, 

; AF011512, 

AAB65712. 
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DR 

EMBL, 

• AF011513, 

AAB65713. 
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DR 

EMBL, 

• AF011514, 

AAB65714 . 

1 
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DR 

EMBL, 

f AF011515, 

AAB65715. 
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DR 

EMBL, 

• AF011516, 

AAB65716. 
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DR 

EMBL, 

: AF011517, 

AAB65717. 
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DR 

EMBL, 

■ AF011518, 
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DR 

EMBL 

• AF011519, 

AAB65719. 
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DR 

EMBL 

• AF011520, 
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DR 

EMBL 

? AF011521, 
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DR 

EMBL, 

• AF011522, 

AAB65722. 
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DR 

EMBL, 

• AF011523, 
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DR 

EMBL, 

• AF011524, 
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DR 
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; AF011525, 
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1 

. 

DR 

EMBL 

; AF011526, 

AAB65726. 

1 

, _ 

DR 

EMBL 

• AF011527, 

AAB65727 . 

1 

_ 

DR 

EMBL 

; AF011528, 

AAB65728 . 

1 

• 

DR 

EMBL 

! AF011529, 

AAB65729. 

1 

. _ 

DR 

EMBL 

! AF011530, 

AAB65730. 

1 

> _ 

DR 

EMBL 

\ AF011531, 

AAB65731. 

1 


DR 

EMBL 

; AF011532, 

AAB65732. 

1 

_ 

DR 

EMBL 

; AF011533, 

AAB65733. 

1 

_ 

DR 

EMBL 

; AF011534, 

AAB65734 . 

1 


DR 

EMBL 

; AF011535, 

AAB65735 . 

1 


DR 

EMBL 

; AF011536, 

AAB65736. 

1 


DR 

EMBL 

; AF011537, 

• AAB65737. 

1 


DR 

EMBL 

? AF031237, 

; AAB94735. 

1 


DR 

EMBL 

? AF052539, 

• AAD18131. 

1 


DR 

EMBL 

; AY221093, 

• AA065971. 

1 


DR 

Genew; HGNC:1606; CCR5 . 



DR 

MIM; 

601373; - 





DR GO; GO: 0005768; C : endosome; TAS . 

DR GO; GO: 0005887; C:integral to plasma membrane; TAS. 

DR GO; GO:0016493; F:OC chemokine receptor activity; NAS . 

DR GO; GO: 0015026; F:coreceptor activity; TAS. 

DR GO; GO: 0007267; P: cell-cell signaling; TAS. 

DR GO; GO: 0006968; P:cellular defense response; TAS. 

DR GO; GO: 0006935; P : chemotaxis ; TAS. 

DR GO; GO: 0007204; P:cytosolic calcium ion concentration elevation; TAS. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; TAS 

DR GO; GO: 0006954; P : inflammatory response; TAS. 

DR GO; GO: 0007125; P: invasive growth; TAS. 

DR GO; GO: 0007203; P : phosphatidylinositol-4 , 5-bisphosphate hydro. . .; TAS 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein; Sulfation; 


KW 

Polymorphism. 




FT 

DOMAIN 

1 

30 

EXTRACELLULAR (POTENTIAL) 

FT 

TRANSMEM 

31 

58 

1 (POTENTIAL). 

FT 

DOMAIN 

59 

68 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

69 

89 

2 (POTENTIAL). 

FT 

DOMAIN 

90 

102 

EXTRACELLULAR (POTENTIAL) 

FT 

TRANSMEM 103 

124 

3 (POTENTIAL) . 


Query Match 47.6%; 
Best Local Similarity 35.0%; 
Matches 99; Conservative 


Score 379; DB 1; Length 352; 
Pred. No. 1.8e-29; 
7; Mismatches 27; Indels 150; 


Gaps 


Qy 
Db 

Qy 

Db 


1 MDYQVS S P I YD I N Y YT SEP CQKINVKQIAA 

I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I 
1 MDYQVS S PI YDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVI LI LINCKR 


31 


61 


30 


60 


-YKCGLC- 


AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I II I I I I : I : : I 

LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 


Qy 

Db 

Qy 

Db 

Qy 

Db 


65 


119 


79 


LCTRSQKEGLHYTC 7 8 

: I I I II I I I I I I I 

1 1 L L T I D RYLAWHAVFAL KART VT FGWT S VI T WWAVFAS L P G 1 1 FTRSQKEGLHYTC 17 8 


-HQRVHGG- 


SSHFPYSQYQFWKNFQTLKI 

I I I I I I 1 I I I I I I I I I I I I I : : I 

179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 

106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I II I I I : I 

239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 2 81 


105 


238 


RESULT 2 
CKR5_PANTR 

ID CKR5_PANTR STANDARD; PRT; 352 AA. 

AC P56440; 002778; 

DT 15-JUL-1998 (Rel. 36, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 

GN CCR5 OR CMKBR5. 

OS Pan troglodytes (Chimpanzee) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Pan. 

OX NCBI_TaxID=9598; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=97268687; PubMed=9108 0 95 ; 

RA Edinger A.L., Amedee A., Miller K. , Doranz B.J., Endres M. , 

RA Sharron M. , Samson M., Lu Z.H., Clements J.E., Murphey-Corb M. , 

RA Peiper S.C., Parmentier M. , Broder C.C., Doms R.W. ; 

RT "Differential utilization of CCR5 by macrophage and T cell tropic 

RT simian immunodeficiency virus strains."; 

RL Proc. Natl. Acad. Sci. U.S.A. 94:4005-4010(1997). 

RN [2] 

RP SEQUENCE FROM N.A. 


RA Zimmerman P. A., Buckler-White A., Alkhatib G.; 

RL Submitted (MAR-1997) to the EMBL/GenBank/DDB J databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98022612; PubMed=9359654 ; 

RA Zhang L., Carruthers CD., He T., Huang Y., Cao Y. , Wang G. , Hahn B., 

RA Ho D.D. ; 

RT "HIV type 1 subtypes, coreceptor usage, and CCR5 polymorphism."; 

RL AIDS Res. Hum. Retroviruses 13:1357-1366(1997). 

RN [4] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=97426118; PubMed=9282822 ; 

RA Zacharova V., Zachar V. , Goustin A.S.; 

RT "Sequence of chemokine receptor gene CCR5 in chimpanzees, a natural 

RT HIV type 1 host."; 

RL AIDS Res. Hum. Retroviruses 13:1159-1161(1997). 

RN [5] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98090115; PubMed=94 3 0250 ; 

RA Pretet J.-L., Zerbib A., Girard M. , Guillet J.-G., Butor C; 

RT "Chimpanzee CXCR4 and CCR5 act as coreceptors for HIV type 1."; 

RL AIDS Res. Hum. Retroviruses 13:1583-1587(1997). 

RN [6] 

RP SEQUENCE FROM N.A. 

RA Zhang Y. , Ryder O.A., Zhang Y. ; 

RT "Sequence comparison of the CCR5 gene in primates and primate 

RT phylogeny."; 

RL Submitted (AUG-1999) to the EMBL/GenBank/DDB J databases. 

CC FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 

CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF005663; AAB62557.1; -. 

DR EMBL; U94329; AAB58446.1; 

DR EMBL; AF011542; AAB65742.1; -. 

DR EMBL; U97666; AAC51670.1; 

DR EMBL; AF011540; AAB65740.1; -. 

DR EMBL; U89797; AAC03717.1; -. 

DR EMBL; AF177894; AAK43377.1; -. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein; Sulfation. 


FT 

DOMAIN 

1 

30 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

31 

58 

1 (POTENTIAL). 

FT 

DOMAIN 

59 

68 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

69 

89 

2 (POTENTIAL) . 

FT 

DOMAIN 

90 

102 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

103 

124 

3 (POTENTIAL) . 

FT 

DOMAIN 

125 

141 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

142 

166 

4 (POTENTIAL) . 

FT 

DOMAIN 

167 

198 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

199 

218 

5 (POTENTIAL). 

FT 

DOMAIN 

219 

235 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

236 

260 

6 (POTENTIAL) . 

FT 

DOMAIN 

261 

277 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

278 

301 

7 (POTENTIAL). 

FT 

DOMAIN 

302 

352 

CYTOPLASMIC (POTENTIAL) . 

FT 

DISULFID 

101 

178 

BY SIMILARITY. 

FT 

MOD_RES 

3 

3 

SULFATION (BY SIMILARITY) . 

FT 

MOD__RES 

10 

10 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

14 

14 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

15 

15 

SULFATION (BY SIMILARITY) . 

FT 

CARBOHYD 

268 

268 

N-LINKED (GLCNAC. . .) (PO 

FT 

CONFLICT 

123 

123 

T -> S (IN REF. 1) . 

SQ 

SEQUENCE 

352 AA; 

40539 

MW; 4A33E698B80FE34C CRC64; 


Query Match 4 6.9%; 

Best Local Similarity 34.6%; 
Matches 98; Conservative 


Score 374; DB 1; Length 352; 
Pred. No. 5.6e-29; 

); Mismatches 27; Indels 150; 


Gaps 


Qy 

Db 


1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I I : I I I I II I I I I I I I I I I I 
1 MDYQVSSPIYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 


Qy 

Db 


31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 


Qy 

Db 

Qy 

Db 


65 LCTRSQKEGLHYTC 7 8 

: I I I I I II I I I I I 

119 1 1 LLT I DRYLAI VHAVFALKARTVT FGWT S VI TWWAVFAS L PGI I FT RS QKEGLH YTC 178 


-HQRVHGG- 


105 


7 9 SSHFPYSQYQFWKNFQTLKI 

I I I I I I I I I I I I I I I I I I I I : : I 

179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 


Qy 


Db 


106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 3 
CKR5_HYLSY 

ID CKR5_HYLSY STANDARD; PRT; 352 AA. 

AC Q95NC5; 

DT 15-MAR-2004 (Rel. 43, Created) 

DT 15-MAR-2004 (Rel. 43, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 


GN 

CCR5 OR CMKBR5. 


OS 

Hylobates 

syndactylus (Siamang) ( Symphalangus syndactylus ) . 

OC 

Eukaryota; Metazoa 

; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC 

Mammalia; 

Eutheria 

; Primates; Catarrhini; Hylobatidae; Hylobates. 

OX 

NCBI TaxID=9590; 


RN 

[1] 



RP 

SEQUENCE 

FROM N.A. 


RX 

MEDLINE=99416438; 

PubMed=10486970; 

RA 

Zhang Y . - 

W., Ryder 

O.A. , Zhang Y. -P. ; 

RT 

"Sequence 

evolution of the CCR5 chemokine receptor gene in primates."; 

RL 

Mol. Biol 

. Evol. 16:1145-1154(1999). 

CC 

-!- FUNCTION: Receptor for a C-C type chemokine. Binds to MI P-l -alpha, 

CC 

MIP-1 

-beta and 

RANTES and subsequently transduces a signal by 

CC 

increasing the 

intracellular calcium ions level. May play a role 

CC 

in the control 

of granulocytic lineage proliferation or 

CC 

differentiation. 

CC 

-!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC 
CC 
CC 

-!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC 

between 

the Swiss 

Institute of Bioinf ormatics and the EMBL outstation - 

CC 

the European Bioinf ormatics Institute. There are no restrictions on its 

CC 

use by 

non-profit institutions as long as its content is in no way 

CC 

modified 

and this 

statement is not removed. Usage by and for commercial 

cc 

entities 

requires 

a license agreement (See http://www.isb-sib.ch/announce/ 

cc 
cc 

DR 

or send an email to license@isb-sib . ch) . 

EMBL; AF177884; AAK43367.1; -. 

DR 

InterPro; 

IPR000276; GPCR Rhodpsn. 

DR 

Pfam; PF00001; 7tm 

l; 1. 

DR 

PRINTS; PR00237; GPCRRHODOPSN . 

DR 

PROSITE; 

PS00237; 

G PROTEIN RECEP Fl 1; 1. 

DR 

PROSITE; 

PS50262; 

G_PROTEIN_RECEP_Fl_2 ; 1 . 

KW 

G-protein 

coupled 

receptor; Transmembrane; Glycoprotein; Sulfation. 

FT 

DOMAIN 

1 

30 EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

31 

58 1 (POTENTIAL) . 

FT 

DOMAIN 

59 

68 CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

69 

8 9 2 (POTENTIAL) . 

FT 

DOMAIN 

90 

102 EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

103 

124 3 (POTENTIAL) . 

FT 

DOMAIN 

125 

141 CYTOPLASMIC (POTENTIAL). 

FT 

TRANSMEM 

142 

166 4 (POTENTIAL) . 

FT 

DOMAIN 

167 

198 EXTRACELLULAR (POTENTIAL). 

FT 

TRANSMEM 

199 

218 5 (POTENTIAL). 

FT 

DOMAIN 

219 

235 CYTOPLASMIC (POTENTIAL). 

FT 

TRANSMEM 

236 

260 6 (POTENTIAL) . 

FT 

DOMAIN 

261 

277 EXTRACELLULAR (POTENTIAL). 

FT 

TRANSMEM 

278 

301 7 (POTENTIAL) . 

FT 

DOMAIN 

302 

352 CYTOPLASMIC (POTENTIAL). 

FT 

DISULFID 

101 

178 BY SIMILARITY. 

FT 

MOD__RES 

3 

3 SULFATION (BY SIMILARITY). 

FT 

MOD_RES 

10 

10 SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

14 

14 SULFATION (BY SIMILARITY) . 

SQ 

SEQUENCE 

352 AA; 

40508 MW; F4F64B3AD5AF658A CRC64; 


Query Match 46.3%; Score 369; DB 1; Length 352; 

Best Local Similarity 34.3%; Pred. No. 1.7e-28; 


Matches 97; Conservative 8; Mismatches 28; Indels 150; Gaps 7; 


Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I 1 I I I I I I I II : I I I I I I II I I I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC -AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMT DI YLLNLAI S DLFFLLTVP FWAH YAAAQWD FGNTMC — QLLTGL YFI GFFSGI FF 118 

Qy 65 LCTRSQKEGLHYTC 78 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYLAIVHAVFALKARTW 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I 1 I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVKVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I II I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 4 
CKR5_PONPY 

ID CKR5_PONPY STANDARD; PRT; 352 AA. 

AC 097881; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 

GN CCR5 OR CMKBR5. 

OS Pongo pygmaeus (Orangutan) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Pongo. 

OX NCBI_TaxID=9600; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99416438; PubMed=10486970 ; 

RA Zhang Y.-W., Ryder O.A., Zhang Y.-P.; 

RT "Sequence evolution of the CCR5 chemokine receptor gene in primates. 11 ; 

RL Mol. Biol. Evol. 16:114 5-1154(1999). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 

CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY : Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 


cc 

DR 

EMBL; AF075446; AAD19858.1; 


DR 

InterPro; 

IPR000276; GPCR Rhodpsn. 

DR 

Pfam; PF00001; 7tm 1; 1. 


DR 

PRINTS; PR00237; GPCRRHODOPSN . 

DR 

PROSITE; 

PS00237; 

G_PROTEIN_ 

RECEP_F1_1; 1. 

DR 

PROSITE; 

PS50262; 

G_PROTEIN_ 

RECEP_F1_2 ; 1 . 

KW 

G-protein 

coupled 

receptor; 

Transmembrane; Glycoprotein 

FT 

DOMAIN 

1 

30 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

31 

58 

1 (POTENTIAL). 

FT 

DOMAIN 

59 

68 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

69 

89 

2 (POTENTIAL) . 

FT 

DOMAIN 

90 

102 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

103 

124 

3 (POTENTIAL). 

FT 

DOMAIN 

125 

141 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

142 

166 

4 (POTENTIAL) . 

FT 

DOMAIN 

167 

198 

EXTRACELLULAR (POTENTIAL). 

FT 

TRANSMEM 

199 

218 

5 (POTENTIAL) . 

FT 

DOMAIN 

219 

235 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

236 

260 

6 (POTENTIAL) . 

FT 

DOMAIN 

261 

277 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

278 

301 

7 (POTENTIAL) . 

FT 

DOMAIN 

302 

352 

CYTOPLASMIC (POTENTIAL) . 

FT 

DISULFID 

101 

178 

BY SIMILARITY. 

FT 

MOD RES 

3 

3 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

10 

10 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

14 

14 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

15 

15 

SULFATION (BY SIMILARITY) . 

SQ 

SEQUENCE 

352 AA; 

40527 MW 

; F4E2F47135AF658A CRC64; 


Sulfation « 


Query Match 46.3%; 
Best Local Similarity 34.3%; 
Matches 97; Conservative 


Score 369; DB 1; Length 352; 
Pred. No. 1.7e-28; 
8; Mismatches 28; Indels 150; 


Qy 

Db 


Gaps 


7; 


1 MDYQVS S P I YD I NYYT S E P CQKI NVKQI AA 30 

I I 1 I I I I I I I I : I I I I I I I I I I I I I I I II 
1 MDYQVS S PT YDI DYYT S EPCQKINVKQI AARLLP PLYSLVFI FGFVGNMLVI LI LINCKR 60 


Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 


31 


61 


65 


119 


79 


106 


239 


YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I 1111111111111:1: : I 

LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I II I 

IILLTIDRYI^IVliAVFALKARTWFGv^SVITWWAVFASLPGIIFTRSQKEGLHYTC 17 8 


SSHFPYSQYQFWKNFQTLKI HQRVHGG 

I I I I I I I I I I I I I I I I I II I : : I 

179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 


105 


238 


GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 

I I I I I I I I I II I I I I I I I : I 

TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 


135 


281 


RESULT 5 
CKR5 PAPHA 


ID CKR5_PAPHA STANDARD; PRT; 352 AA. 

AC P56441; 

DT 15-JUL-1998 (Rel. 36, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5). 

GN CCR5 OR CMKBR5. 

OS Papio hamadryas (Hamadryas baboon) , and 

OS Papio anubis (Olive baboon) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 

OC Cercopithecinae; Papio. 

OX NCBI_TaxID=9557, 9555; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC SPECIES=P . hamadryas ; 

RX MEDLINE=97268687; PubMed=9108095 ; 

RA Edinger A.L., Amedee A., Miller K., Doranz B.J., Endres M. , 

RA Sharron M. , Samson M., Lu Z.H., Clements J.E., Murphey-Corb M. , 

RA Peiper S.C., Parmentier M. , Broder C.C., Doms R.W. ; 

RT "Differential utilization of CCR5 by macrophage and T cell tropic 

RT simian immunodeficiency virus strains."; 

RL Proc. Natl. Acad. Sci. U.S.A. 94:4 005-4 010(1997). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC SPECIES=P . hamadryas ; 

RX MEDLINE=99210133; PubMed=101957 58 ; 

RA Saksena N.K., Wang B., Novembre F.J., Bolton W. , Smit T.K., Lai R.B.; 

RT "Species-specific changes in the CCR5 gene from African and Asian 

RT nonhuman primates."; 

RL AIDS Res. Hum. Retroviruses 15:479-483(1999). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC SPECIES=P . anubis ; 

RA Benton P. A., Timanus D.K., Shearer M.H., Lee D.R., Kennedy R.C.; 

RL Submitted (NOV-1997) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 

CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF005658; AAB62552.1; -. 

DR EMBL; AF105287; AAD20556.1; -. 

DR EMBL; AF105288; AAD20557.1; -. 

DR EMBL; AF105289; AAD20558.1; -. 

DR EMBL; AF105290; AAD20559.1; -. 


DR EMBL; AF023452; AAC63830.1; 
DR InterPro; IPR000276; GPCR_Rhodpsn . 
DR Pfam; PF00001; 7tm__l; 1. 
DR PRINTS; PR00237; GPCRRHODOPSN . 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 

Query Match 45.9%; Score 366; DB 1; Length 352; 

Best Local Similarity 33.9%; Pred. No. 3.3e-28; 

Matches 96; Conservative 9; Mismatches 28; Indels 150; Gaps 7 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I II I I I : I I I I I I I I I II I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLN LAIS DLL FLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I : I I I I I I I 

Db 119 IILLTIDRYIAIVIIAVFALKARTWFGWTSVITWVVAVFASLPGIIFTRSQREGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I II I I I I I I I I I I II I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


PROSITE; 

PS00237; 

G PROTEIN 

RECEP_F1__1; 1. 

PROSITE; 

PS50262; 

G_PROTEIN_ 

RECEP_F1_2; 1. 

G-protein 

coupled 

receptor; 

Transmembrane; Glycoprotein 

DOMAIN 

1 

30 

EXTRACELLULAR (POTENTIAL) . 

TRANS MEM 

31 

58 

1 (POTENTIAL) . 

DOMAIN 

59 

68 

CYTOPLASMIC (POTENTIAL) . 

TRANSMEM 

69 

89 

2 (POTENTIAL). 

DOMAIN 

90 

102 

EXTRACELLULAR (POTENTIAL) . 

TRANSMEM 

103 

124 

3 (POTENTIAL). 

DOMAIN 

125 

141 

CYTOPLASMIC (POTENTIAL) . 

TRANSMEM 

142 

166 

4 (POTENTIAL) . 

DOMAIN 

167 

198 

EXTRACELLULAR (POTENTIAL) . 

TRANSMEM 

199 

218 

5 (POTENTIAL) . 

DOMAIN 

219 

235 

CYTOPLASMIC (POTENTIAL). 

TRANSMEM 

236 

260 

6 (POTENTIAL) . 

DOMAIN 

261 

277 

EXTRACELLULAR (POTENTIAL) . 

TRANSMEM 

278 

301 

7 (POTENTIAL). 

DOMAIN 

302 

352 

CYTOPLASMIC (POTENTIAL) . 

DISULFID 

101 

178 

BY SIMILARITY. 

MOD RES 

3 

3 

SULFATION (BY SIMILARITY) . 

MOD_RES 

10 

10 

SULFATION (BY SIMILARITY) . 

MOD RES 

14 

14 

SULFATION (BY SIMILARITY) . 

MOD_RES 

15 

15 

SULFATION (BY SIMILARITY) . 

CARBOHYD 

268 

268 

N-LINKED (GLCNAC. . .) (PO 1 ] 

SEQUENCE 

352 AA; 

40489 MW 

; 5E1504A9BA1FE8B2 CRC64; 


RESULT 6 
CKR5 PYGBI 


ID CKR5_PYGBI STANDARD; PRT; 352 AA. 

AC 097880; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5). 

GN CCR5 OR CMKBR5. 

OS Pygathrix bieti (Black snub-nosed monkey) (Rhinopithecus bieti) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; Colobinae; 

OC Pygathrix. 

OX NCBI_TaxID=61621; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99416438; PubMed=10486970; 

RA Zhang Y.-W., Ryder O.A., Zhang Y.-P.; 

RT "Sequence evolution of the CCR5 chemokine receptor gene in primates . " ; 

RL Mol. Biol. Evol. 16:1145-1154(1999). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 
CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 


CC 
DR 

EMBL; AF075445; AAD19857.1; 


DR 

InterPro; 

IPR000276; GPCR Rhodpsn. 

DR 

Pfam; PF00001; 7tm 1; 1. 


DR 

PRINTS; PR00237; 

GPCRRHODOPSN. 

DR 

PROSITE; 

PS00237; 

G_PR0TEIN 

_RECEP_F1_1 ; 1 . 

DR 

PROSITE; 

PS50262; 

G_PROTEIN~ 

_RECEP_F1_2 ; 1 . 

KW 

G-protein 

coupled 

receptor; 

Transmembrane; Glycoprotein 

FT 

DOMAIN 

1 

30 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

31 

58 

1 (POTENTIAL) . 

FT 

DOMAIN 

59 

68 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

69 

89 

2 (POTENTIAL). 

FT 

DOMAIN 

90 

102 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

103 

124 

3 (POTENTIAL) . 

FT 

DOMAIN 

125 

141 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

142 

166 

4 (POTENTIAL). 

FT 

DOMAIN 

167 

198 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

199 

218 

5 (POTENTIAL) . 

FT 

DOMAIN 

219 

235 

CYTOPLASMIC (POTENTIAL). 

FT 

TRANSMEM 

236 

260 

6 (POTENTIAL) . 

FT 

DOMAIN 

261 

277 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

278 

301 

7 (POTENTIAL) . 

FT 

DOMAIN 

302 

352 

CYTOPLASMIC (POTENTIAL) . 

FT 

DISULFID 

101 

178 

BY SIMILARITY. 


FT 

MOD RES 

3 

3 

SULFATION (BY SIMILARITY) 

FT 

MOD_RES 

10 

10 

SULFATION (BY SIMILARITY) 

FT 

MOD_RES 

14 

14 

SULFATION (BY SIMILARITY) 

FT 

MOD_RES 

15 

15 

SULFATION (BY SIMILARITY) 

SQ 

SEQUENCE 

352 AA; 

40585 

MW; 4366F142730F938F CRC64; 


Query Match 45.8%; Score 365; DB 1; Length 352; 

Best Local Similarity 33.6%; Pred. No. 4.1e-28; 

Matches 95; Conservative 10; Mismatches 28; Indels 150; 


Gaps 


Qy 

1 

Db 

1 

Qy 

31 

Db 

61 

Qy 

65 

Db 

119 

Qy 

79 

Db 

179 

Qy 

106 

Db 

239 


MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I II II I I I : I I I I I I I II : I I I I I I I 

MDYQVSSPTYDIDYYTSEPCQKVNVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I II I I I I I : I : : I 

LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

LCTRSQKEGLHYTC 78 

: 1111:1111111 

1 1 LLT I DRYLAI VHAVFALFCART VT FGWT S VI TWWAVFAS LPGI I FTRS QREGLH YTC 178 

SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I II I I I I I I I I I I I : : I 

SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVI CYSGI LKTLLRCRNEKKRHRAVRLI F 2 3 8 


-GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I II I I I I I : I 


RESULT 7 
CKR5_PYGNE 

ID CKR5_PYGNE STANDARD; PRT; 352 AA. 

AC 097882; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 

GN CCR5 OR CMKBR5. 

OS Pygathrix nemaeus (Dove langur) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; Colobinae; 

OC Pygathrix. 

OX NCBI_TaxID=54133; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=994 16438; PubMed=10486970 ; 

RA Zhang Y.-W., Ryder O.A., Zhang Y.-P.; 

RT "Sequence evolution of the CCR5 chemokine receptor gene in primates."; 

RL Mol. Biol. Evol. 16:1145-1154(1999). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 
CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 


cc 
cc 
cc 

SIMILARITY: Belongs to 

family 1 of G-protein coupled receptors. 

This SWISS-PROT entry is copyright. It is 

produced through a collaboration 

cc 

between 

the Swiss 

Institute of Bioinf ormatics and the EMBL outstation - 

cc 

the European Bioinf ormatics 

Institute. There are no restrictions on its 

cc 

use by 

non-profit institutions as long 

as its content is in no way 

cc 

modified 

and this 

statement 

is not removed 

Usage by and for commercial 

cc 

entities 

requires 

a license 

agreement (See 

http: / /www.isb-sib. ch/announce/ 

cc 
cc 

DR 

or send an email to license@isb-sib . ch) . 


EMBL; AF075448; AAD19860.1; 



DR 

InterPro; 

IPR000276; GPCR Rhodpsn. 


DR 

Pfam; PF00001; 7tm 1; 1. 



DR 

PRINTS; PR00237; GPCRRHODOPSN . 


DR 

PROSITE; 

PS00237; 

G_PROTEIN 

_RECEP_F1 1; 1. 


DR 

PROSITE; 

PS50262; 

G_PROTEIN 

_RECEP_F1_2; 1. 


KW 

G-protein 

coupled 

receptor; 

Transmembrane ; 

Glycoprotein; Sulfation. 

FT 

DOMAIN 

1 

30 

EXTRACELLULAR 

(POTENTIAL) . 

FT 

TRANSMEM 

31 

58 

1 (POTENTIAL) 


FT 

DOMAIN 

59 

68 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

69 

89 

2 (POTENTIAL) 


FT 

DOMAIN 

90 

102 

EXTRACELLULAR 

(POTENTIAL) . 

FT 

TRANSMEM 

103 

124 

3 (POTENTIAL) 


FT 

DOMAIN 

125 

141 

CYTOPLASMIC ( POTENTIAL) . 

FT 

TRANSMEM 

142 

166 

4 (POTENTIAL) 


FT 

DOMAIN 

167 

198 

EXTRACELLULAR 

(POTENTIAL) . 

FT 

TRANSMEM 

199 

218 

5 (POTENTIAL) 


FT 

DOMAIN 

219 

235 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

236 

260 

6 (POTENTIAL) 


FT 

DOMAIN 

261 

277 

EXTRACELLULAR 

(POTENTIAL) . 

FT 

TRANSMEM 

278 

301 

7 (POTENTIAL) 


FT 

DOMAIN 

302 

352 

CYTOPLASMIC (POTENTIAL) . 

FT 

DISULFID 

101 

178 

BY SIMILARITY 


FT 

MOD__RES 

3 

3 

SULFATION (BY 

SIMILARITY) . 

FT 

MOD_RES 

10 

10 

SULFATION (BY 

SIMILARITY) . 

FT 

MOD RES 

14 

14 

SULFATION (BY 

SIMILARITY) . 

FT 

MOD_RES 

15 

15 

SULFATION (BY 

SIMILARITY) . 

SQ 

SEQUENCE 

352 AA; 

40532 MW; FE4F9D98D3B3E8 61 CRC64; 


Query Match 45.8%; Score 365; DB 1; Length 352; 

Best Local Similarity 33.6%; Pred. No. 4.1e-28; 

Matches 95; Conservative 10; Mismatches 28; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I : I I II I I I I I : I I I I I M 
Db 1 MDYQVSSPTYDIDYYTSEPCQKVNVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

QY 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I II I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I : I I I I II I 

Db 119 IILLTIDRYLAIVliAVFALKARTWFGWTSVITWWAVFASLPGIIFTRSQREGLHYTC 178 


79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I II I I I I I I I I : : | 


Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLIMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 


Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 2 81 

RESULT 8 
CKR5_TRAFR 

ID CKR5_TRAFR STANDARD; PRT; 352 AA. 

AC 097878; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5). 

GN CCR5 OR CMKBR5. 

OS Trachypithecus francoisi (Francois' langur) (Indochinese langur) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; Colobinae; 

OC Trachypithecus . 

OX NCBI_TaxID=54180; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99416438; PubMed=104 8 6970 ; 

RA Zhang Y.-W., Ryder O.A. , Zhang Y.-P.; 

RT "Sequence evolution of the CCR5 chemokine receptor gene in primates."; 

RL Mol. Biol. Evol. 16:1145-1154(1999). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 
CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF075442; AAD19854.1; -. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm 1; 1. 


DR 

PRINTS; PR00237; 

GPCRRHODOPSN. 

DR 

PROSITE; 

PS00237; 

G_PROTEIN 

RECEP Fl 1; 1. 

DR 

PROSITE; 

PS50262; 

G_PR0TEIN_ 

~RECEP_F1_2; 1. 

KW 

G-protein 

coupled 

receptor; 

Transmembrane; Glycoprotei 

FT 

DOMAIN 

1 

30 

EXTRACELLULAR (POTENTIAL) 

FT 

TRANSMEM 

31 

58 

1 (POTENTIAL) . 

FT 

DOMAIN 

59 

68 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

69 

89 

2 (POTENTIAL) . 

FT 

DOMAIN 

90 

102 

EXTRACELLULAR (POTENTIAL) 

FT 

TRANSMEM 

103 

124 

3 (POTENTIAL) . 

FT 

DOMAIN 

125 

141 

CYTOPLASMIC (POTENTIAL) . 


FT 

TRANSMEM 

142 

166 

4 (POTENTIAL). 

FT 

DOMAIN 

167 

198 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

199 

218 

5 (POTENTIAL). 

FT 

DOMAIN 

219 

235 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

236 

260 

6 (POTENTIAL). 

FT 

DOMAIN 

261 

277 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

278 

301 

7 (POTENTIAL). 

FT 

DOMAIN 

302 

352 

CYTOPLASMIC (POTENTIAL) . 

FT 

DISULFID 

101 

178 

BY SIMILARITY. 

FT 

MOD RES 

3 

3 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

10 

10 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

14 

14 

SULFATION (BY SIMILARITY) . 

FT 

MOD RES 

15 

15 

SULFATION (BY SIMILARITY) . 

SQ 

SEQUENCE 

352 AA; 

40496 

MW; 4366F148C255938F CRC64; 


Query Match 45.8%; Score 365; DB 1; Length 352; 

Best Local Similarity 33.6%; Pred. No. 4.1e-28; 

Matches 95; Conservative 10; Mismatches 28; Indels 150; Gaps 7 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I II : I I I I I I I I I : I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKVNVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

QY 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: 1111:1111111 

Db 119 IILLTIDRYLAIVliAVFALKARTVTFGV^SVITWVVAVFASLPGIIFTRSQREGLHYTC 178 

Qy 7 9 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I II I II I I I I I I : : | 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCS S SNRLDGHQRV 135 

I I I I I I I I I I II I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 9 
CKR5_TRAPH 

ID CKR5_TRAPH STANDARD; PRT; 352 AA. 

AC 097879; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 

GN CCR5 OR CMKBR5. 

OS Trachypithecus phayrei (Phayre f s leaf monkey). 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; Colobinae; 

OC Trachypithecus. 

OX NCBI_TaxID=61618; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99416438; PubMed=10486970; 


RA 

Zhang Y.- 

W., Ryder O.A. , Zhang Y.-P.; 

RT 

"Sequence 

evolution 

of the 

CCR5 chemokine receptor gene in primates."; 

RL 

Mol . Biol 

. Evol. 16 

:1145-1154 (1999) . 

CC 

-!- FUNCTION: Receptor for 

a C-C type chemokine. Binds to MIP-1-alpha, 

CC 

MIP-1 

-beta and 

RANTES and subsequently transduces a signal by 

CC 

increasing the 

intracellular calcium ions level. May play a role 

cc 

in the control of granulocytic lineage proliferation or 

cc 

differentiation 



cc 

-!- SUBCELLULAR LOCATION: Integral membrane protein. 

cc 
cc 
cc 

-!- SIMILARITY: Belongs to 

family 1 of G-protein coupled receptors. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 

cc 

between 

the Swiss 

Institute of Bioinf ormatics and the EMBL outstation - 

cc 

the European Bioinf ormatics 

Institute. There are no restrictions on its 

cc 

use by 

non-profit 

institutions as long as its content is in no way 

cc 

modified 

and this statement 

is not removed. Usage by and for commercial 

cc 

entities 

requires a 

license 

agreement (See http://www.isb-sib.ch/announce/ 

cc 
cc 

DR 

or send an email to 

license@isb-sib . ch) . 

EMBL; AF075443; AAD19855.1; 


DR 

InterPro ; 

IPR000276; GPCR_Rhodpsn . 

DR 

Pfam; PF00001; 7tm 

1; 1. 


DR 

PRINTS; PRO 02 37; GPCRRHODOPSN . 

DR 

PROSITE; 

PS00237; G 

_PROTEIN 

_RECEP_F1_1; 1. 

DR 

PROSITE; 

PS50262; G 

PROTEIN 

_RECEP__F1_2; 1. 

KW 

G-protein 

coupled receptor; 

Transmembrane; Glycoprotein; Sulfation. 
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FT 

DISULFID 
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178 

BY SIMILARITY. 

FT 

MOD_RES 

3 

3 

SULFATION (BY SIMILARITY) . 

FT 

MOD RES 

10 

10 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

14 

14 

SULFATION (BY SIMILARITY) . 

FT 

MOD RES 

15 

15 

SULFATION (BY SIMILARITY) . 

SQ 

SEQUENCE 

352 AA; 

40509 MW; 4366F148D3A5938F CRC64; 


Query Match 45.8%; Score 365; DB 1; Length 352; 

Best Local Similarity 33.6%; Pred. No. 4.1e-28; 

Matches 95; Conservative 10; Mismatches 28; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I II I I I I I : I I I I I I I I I : I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKVNVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 


QY 


31 


YKCGLO 


AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 


i i i I I l i I i i I I i i i . i ; : | 

Db 61 LKSMTDI YLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 78 

: I II I : I I I I I I I 

Db 119 1 1 LLT I D R YLAJ VHAVFALKART VT FGWT S VI T WWAVFAS L P G 1 1 FT RS Q REGLH YT C 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG— 105 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 10 
CKR5_GORGO 

ID CKR5_GORGO STANDARD; PRT; 352 AA. 

AC P56439; 

DT 15-JUL-1998 (Rel. 36, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 

GN CCR5 OR CMKBR5. 

OS Gorilla gorilla gorilla (Lowland gorilla) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Gorilla. 

OX NCBI_TaxID=9595; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=97268687; PubMed=9108095 ; 

RA Edinger A.L., Amedee A., Miller K., Doranz B.J., Endres M. , 

RA Sharron M., Samson M. , Lu Z.H., Clements J.E., Murphey-Corb M. , 

RA Peiper S.C., Parmentier M. , Broder C.C., Doms R.W. ; 

RT "Differential utilization of CCR5 by macrophage and T cell tropic 

RT simian immunodeficiency virus strains."; 

RL Proc. Natl. Acad. Sci. U.S.A. 94:4005-4 010(1997). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 

CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF005659; AAB62553.1; -. 

DR InterPro; IPR000276; GPCRJRhodpsn . 

DR Pfam; PF00001; 7tm 1; 1. 


DR 

PRINTS; PR00237; GPCRRHODOPSN . 

DR 

PROSITE; 

PS00237; 

G PROTEIN 

RECEP Fl 1; 1. 

DR 

PROSITE; 

PS50262; 

G_PROTEIN_ 

"RECEP~F1~2 ; 1 . 

KW 

G-protein 

coupled 

receptor ; 

Transmembrane; Glycoprotein 
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FT 

DISULFID 

101 

178 

BY SIMILARITY. 

FT 

MOD RES 

3 

3 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

10 

10 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

14 

14 

SULFATION (BY SIMILARITY) . 

FT 

MOD RES 

15 

15 

SULFATION (BY SIMILARITY) . 

SQ 

SEQUENCE 

352 AA; 

40515 MW 

; D0E6FCB9FE5EAC84 CRC64; 


Query Match 45.7%; Score 364; DB 1; Length 352; 

Best Local Similarity 33.9%; Pred. No. 5.1e-28; 

Matches 96; Conservative 8; Mismatches 29; Indels 150; Gaps 7 

Qy 1 MD YQVS S P I YDI N YYT SEP CQKI NVKQI AA 30 

I I I I I I I I 111:111111111 I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKTNVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 7 8 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYLAIVmVFALKARTVTFGVA^SVITWWAVFASLPGIIFTRSQKEGLHYTC 17 8 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I :: I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 11 
CKR5_MACMU 

ID CKR5_MACMU STANDARD; PRT; 352 AA. 

AC P79436; 002746; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 


DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) * 

GN CCR5 OR CMKBR5 . 

OS Macaca mulatta (Rhesus macaque), 

OS Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey) , and 

OS Macaca nemestrina (Pig-tailed macaque) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 

OC Cercopithecinae; Macaca. 

OX NCBI_TaxID=9544, 9541, 9545; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC SPECIES=M. mulatta; 

RX MEDLINE=97184592; PubMed=9 032394 ; 

RA Marcon L., Choe H., Martin K.A., Farzan M. , Ponath P.D., Wu L., 

RA Newman W. , Gerard N., Gerard C, Sodroski J.; 

RT "Utilization of C-C chemokine receptor 5 by the envelope 

RT glycoproteins of a pathogenic simian immunodeficiency virus, 

RT SIVmac2 39."; 

RL J. Virol. 71:2522-2527(1997). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC SPECIES=M. mulatta; STRAIN=Indian macaque; 

RX MEDLINE=97213934; PubMed-9 060623 ; 

RA Chen Z., Zhou P., Ho D.D., Landau N.R., Marx P. A.; 

RT "Genetically divergent strains of simian immunodeficiency virus use 

RT CCR5 as a coreceptor for entry."; 

RL J. Virol. 71:2705-2714(1997). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC SPECIES=M. mulatta; 

RX MEDLINE=21354176; PubMed=l 14 61684 ; 

RA Margulies B.J., Hauer D.A. , Clements J.E.; 

RT "Identification and comparison of eleven rhesus macaque chemokine 

RT receptors."; 

RL AIDS Res. Hum. Retroviruses 17:981-986(2001). 

RN [4] 

RP SEQUENCE FROM N.A. 

RC SPECIES=M. mulatta, M . fascicularis , and M. nemestrina ; 

RX MEDLINE=97268687; PubMed=9108095 ; 

RA Edinger A.L., Amedee A., Miller K., Doranz B.J., Endres M. , 

RA Sharron M. , Samson M. , Lu Z.H., Clements J.E., Murphey-Corb M., 

RA Peiper S.C., Parmentier M. , Broder C.C., Doms R.W.; 

RT "Differential utilization of CCR5 by macrophage and T cell tropic 

RT simian immunodeficiency virus strains."; 

RL Proc. Natl. Acad. Sci. U.S.A. 94:4005-4 010(1997). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 
CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 


cc 

use by 

non-profit institutions as long as its content is in no way 

cc 

modified 

and this 

statement 

is not removed. Usage by and for commercial 

cc 

entities 

requires 

a license 

agreement (See http://www.isb-sib.ch/announce/ 

cc 
cc 

DR 

or send an email to license@isb-sib.ch). 

EMBL; U77672; AAC51109.1; 


DR 

EMBL; U73739; AAC51158.1; -. 


DR 

EMBL; U96762; AAC34132.1; 


DR 

EMBL; AF005660; AAB62554.1; 


DR 

EMBL; AF005661; AAB62555.1; 


DR 

EMBL; AF005662; AAB62556.1; 


DR 

InterPro; 

IPR000276; GPCR Rhodpsn. 

DR 

Pfam; PF00001; 7tm 1; 1. 


DR 

PRINTS; PR00237; GPCRRHODOPSN . 

DR 

PROSITE; 

PS00237 ; 

G PROTEIN 

RECEP Fl 1; 1. 

DR 

PROSITE; 

PS50262 ; 

G PROTEIN 

S-> X- X W-' X. X— 1 111 

RECEP Fl 2 ; 1 

KW 

G— protein 

couoled 

recent o r : 

x x a. ii oiLiv^iiLkJ i.auc / VJX, jr w £-/ X W v3 _l_ 1 1 / iJ UJ. La L J. Ull • 
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7 (POTENTIAL) . 
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DOMAIN 
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FT 

DISULFID 

101 

178 

BY SIMILARITY. 

FT 

MOD_RES 

3 

3 

SULFATION (BY SIMILARITY) . 

FT 

MOD RES 

10 

10 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

14 

14 

SULFATION (BY SIMILARITY) . 

FT 

MOD RES 

15 

15 

SULFATION (BY SIMILARITY) . 

FT 

CARBOHYD 

268 

268 

N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT 

CONFLICT 

241 

241 

M -> I (IN REF. 3) . 

FT 

CONFLICT 

292 

292 

I -> M (IN REF. 3) . 

SQ 

SEQUENCE 

352 AA; 

40507 MW 

; 58B96C85909FACB2 CRC64; 


Query Match 45.5%; Score 363; DB 1; Length 352; 

Best Local Similarity 33.6%; Pred. No. 6.4e-28; 

Matches 95; Conservative 10; Mismatches 28; Indels 150; Gaps 7; 

Qy 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I : II I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 118 


Qy 65 LCTRSQKEGLHYTC 78 

: 1 I I I : I I I I I I I 

Db 119 IILLTIDRYIxAIVHAVFALKARTVTFGWTSVITWWAVFASLPGIIFTRSQREGLHYTC 17 8 


Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I : : : I 

Db 179 SSHFPYSQYQFWKNFQTLKKV-ILGLVLPLLV1WICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 2 81 


RESULT 12 
CKR5_HYLML 

ID CKR5_HYLML STANDARD; PRT; 352 AA. 

AC Q95NC0; 

DT 15-MAR-2004 (Rel. 43, Created) 

DT 15-MAR-2004 (Rel. 43, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 

GN CCR5 OR CMKBR5. 

OS Hylobates moloch (Silvery gibbon) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hylobatidae; Hylobates. 

OX NCBI_TaxID=81572; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99416438; PubMed=10486970 ; 

RA Zhang Y.-W., Ryder O.A., Zhang Y.-P.; 

RT "Sequence evolution of the CCR5 chemokine receptor gene in primates."; 

RL Mol. Biol. Evol. 16:1145-1154(1999). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 
CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

CC 

DR EMBL; AF177899; AAK43382.1; -. 

DR InterPro; IPR000276; GPCRJRhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein; Sulfation. 
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FT 

DISULFID 

101 

178 

BY SIMILARITY. 

FT 

MOD RES 

3 

3 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

10 

10 

SULFATION (BY SIMILARITY) . 

FT 

MOD RES 

14 

14 

SULFATION (BY SIMILARITY) . 

SQ 

SEQUENCE 

352 AA; 

40436 

MW; 9623CA98340CF274 CRC64; 


Query Match 45.4%; Score 362; DB 1; Length 352; 

Best Local Similarity 33.9%; Pred. No. 8e-28; 

Matches 96; Conservative 8; Mismatches 2 9; Indels 150; Gaps 7 

Qy 1 MDYQVS S P I YDI N Y YT SEP CQKINVKQIAA 30 

I I I I I I I I I I I : I I I I I I I I I I I I I I I I 
Db 1 MDYQVS S PT YDI DYYTS GPCQKINVKQI AARLLPPLYSLVFI FGFVGNMLVI LI LINCKR 60 

Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTD I YLLNLAI S DL FFLLT VP FWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

Qy 65 LCTRSQKEGLHYTC 78 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYLAIVimVFALKARTWFGvVTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

QY 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I! I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 13 
CKR5_CERPY 

ID CKR5__CERPY STANDARD; PRT; 352 AA. 

AC Q9TV42; 

DT 15-MAR-2004 (Rel. 43, Created) 

DT 15-MAR-2004 (Rel. 43, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 

GN CCR5 OR CMKBR5. 

OS Cercopithecus pygerythrus (Vervet monkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 

OC Cercopithecinae ; Cercopithecus . 

OX NCBIJTaxID=60710; 
RN [1] 

RP SEQUENCE FROM N.A. 


RC STRAIN=100; 

RX MEDLINE=99335215; PubMed=10408730 ; 

RA Muller-Trutwin M.C., Corbet S., Hansen J., Georges-Courbot M.-C, 

RA Diop 0., Rigoulet J., Barre-Sinoussi F. f Fomsgaard A.; 

RT "Mutations in CCR5-coding sequences are not associated with SIV 

RT carrier status in African nonhuman primates."; 

RL AIDS Res. Hum. Retroviruses 15:931-939(1999). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 
CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF035222; AAD44015.1; -. 

DR InterPro; IPR000276; GPCR__Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 


DR 

PROSITE; 

PS00237; 

G_PROTEIN_ 

RECEP_F1_1 ; 1 . 

DR 

PROSITE; 

PS50262; 

G_PROTEIN_ 

RECEP_F1_2 ; 1 . 

KW 

G-protein 

coupled 

receptor; 

Transmembrane; Glycoprotei 
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FT 
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10 

SULFATION (BY SIMILARITY) 

FT 

MOD_RES 

15 

15 

SULFATION (BY SIMILARITY) 

SQ 

SEQUENCE 

352 AA; 

40558 MW 

; EF17D67CBCCC3DB0 CRC64; 


Query Match 45.3%; Score 361; DB 1; Length 352; 

Best Local Similarity 33.6%; Pred. No. 9.9e-28; 

Matches 95; Conservative 9; Mismatches 29; Indels 150; Gaps 7; 


Qy 


1 MDYQVSSPIYDINYYTSEPCQKINVKQIAA- 
I I I I I I I I I I I : I I I I I I I I I I I I I I I I I 


30 


Db 


1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 


Qy 31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

1 I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNIAISDLLFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 

Qy 65 LCTRSQKEGLHYTC 78 

: 1111:11111 I 

Db 119 IILLTIDRYIAIVHAVFALKARTVTFGVVTSVITWWAVFASLPRIIFTRSQREGLHYAC 178 

Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I II I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I 11 I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 14 
CKR5_CERTO 

ID CKR5_CERTO STANDARD; PRT; 352 AA. 

AC 062743; 062744; 062745; 062746; 

DT 15-DEC-1998 (Rel. 37, Created) 

DT 15-DEC-1998 (Rel. 37, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 

GN CCR5 OR CMKBR5. 

OS Cercocebus torquatus atys (Red-crowned mangabey) (Sooty mangabey). 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 

OC Cercopithecinae; Cercocebus. 

OX NCBI_TaxID=9531 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Isolate 079, 085, 087, and 089; 

RX MEDLINE=98321155; PubMed=965699 9 ; 

RA Chen Z., Gettie A., Ho D.D., Marx P. A.; 

RT "Primary SIVsm isolates use the CCR5 coreceptor from sooty mangabeys 

RT naturally infected in west Africa: a comparison of coreceptor usage 

RT of primary SIVsm, HIV-2, and SIVmac."; 

RL Virology 246:113-124(1998). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 
CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 


cc 

DR EMBL; AF051902; AAC39830.1; 

DR EMBL; AF051903; AAC39831.1; -. 

DR EMBL; AF051904; AAC39832.1; -. 

DR EMBL; AF051905; AAC39833.1; -. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein; Sulfation. 
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FT 
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10 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

14 

14 

SULFATION (BY SIMILARITY) . 

FT 

MOD RES 

15 

15 

SULFATION (BY SIMILARITY) . 

FT 

VARIANT 

2 

2 

D -> E (IN ISOLATE 087) . 

FT 

VARIANT 

3 

3 

Y -> D (IN ISOLATE 079) . 

FT 

VARIANT 

25 

25 

V -> G (IN ISOLATE 087) . 

FT 

VARIANT 

100 

100 

M -> K (IN ISOLATE 079) . 

FT 

VARIANT 

107 

107 

L -> V (IN ISOLATE 089) . 

FT 

VARIANT 

134 

134 

V -> G (IN ISOLATE 079) . 

FT 

VARIANT 

146 

146 

V -> L (IN ISOLATES 085 AND 

FT 

VARIANT 

340 

340 

T -> I (IN ISOLATE 079) . 

SQ 

SEQUENCE 

352 AA; 

40489 

MW; 20A196E2D47E49CA CRC64; 


Query Match 45.3%; Score 361; DB 1; Length 352; 

Best Local Similarity 33.6%; Pred. No. 9.9e-28; 

Matches 95; Conservative 9; Mismatches 29; Indels 150; Gaps 7; 
Qy 1 MD YQVS S P I YD I N Y YT S E P CQK I N VKQ I AA 30 


i i i i i i i i i i i • i i i i i i i i i i i i i i i i i 
1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 



Db 


QY 


31 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 


Db 


61 LKSMTDI YLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 


Qy 


Db 


65 LCTRSQKEGLHYTC 7 8 

: 1111:1111111 

119 1 1 LLT I DRYIxAI VHAVFALKART VT FGWT S VI TWWAVFAS L P GI I FT RSQREGLH YTC 17 8 


Qy 79 SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I i II II I I I I I I i : : I 

Db 179 SPHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 106 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I I I I I I I I I I II I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 15 
CKR5_HYLLE 

ID CKR5_HYLLE STANDARD; PRT; 352 AA. 

AC 097883; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5 ) . 

GN CCR5 OR CMKBR5. 

OS Hylobates leucogenys (White-cheeked gibbon) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hylobatidae; Hylobates. 

OX NCBI_TaxID=61853; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99416438; PubMed=10486970 ; 

RA Zhang Y.-W., Ryder O.A., Zhang Y.-P.; 

RT "Sequence evolution of the CCR5 chemokine receptor gene in primates."; 

RL Mol. Biol. Evol. 16:1145-1154(1999). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 

CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

CC 

DR EMBL; AF075451; AAD19863.1; 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein; Sulfation. 
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SQ 

SEQUENCE 

352 AA; 

40445 

MW; 4F8E4F344CEB7C91 CRC64; 


Query Match 45.0%; Score 359; DB 1; Length 352; 

Best Local Similarity 33.9%; Pred. No. 1.6e-27; 

Matches 96; Conservative 8; Mismatches 29; Indels 150; Gaps 

MDYQVSSPIYDINYYTSEPCQKINVKQIAA 30 

I I I I I I I I I I I : I I I II I I I I I I I I I I I 
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179 

Qy 
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Db 

239 


60 


-YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 64 

| | I I I I I I I I I I I I I : I : : I 


LCTRSQKEGLHYTC 7 8 

: I I I II I I I I I I I 

IILLTIDRYLAIVmVFALKARTWFGWTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

SSHFPYSQYQFWKNFQTLKI HQRVHGG 105 

I I I I I I I I I I I I I I I I II I I :: I 

SSHFPYSQYQFWKNFQTLKI VI LGLVTjPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 23 8 

GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 135 

I I I I I I II I I I I I I I I I I : I 

TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


Search completed: March 5, 2004, 16:23:42 
Job time : 19.7407 sees 


